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Preface 


Important tasks must be completed on time and with guaranteed quality; that is, the 
consensus reached by system designers and users. However, for too long, important 
tasks have often been given unnecessary urgency, and people intuitively believe that 
important tasks should be executed first so that their performance can be guaranteed. 
Actually, in most cases, their performance can be guaranteed even if they are 
executed later, and the "early" resources can be utilized for other, more urgent tasks. 
Therefore, confusing importance with urgency hinders the proper use of system 
resources. In 2007, mixed criticality was proposed to indicate that a system may 
contain tasks of various importance levels. Since then, system designers and users 
have distinguished between importance and urgency. 

In the industrial field, due to the harsh environment they operate in, industrial 
wireless networks' quality of service (QoS) has always been a bottleneck restricting 
their applications. Therefore, this book introduces criticality to label important data, 
which is then allocated more transmission resources, ensuring that important data's 
QoS requirements can be met to the extent possible. 

To help readers understand how to apply mixed criticality to industrial wireless 
networks, the content is divided into four parts. First, we introduce how to integrate 
the model of mixed-criticality data into industrial wireless networks (Chap. 1). 
Second, we explain how to analyze the schedulability of mixed-criticality data under 
existing scheduling algorithms (Chaps. 2 and 3). Third, we present a range of novel 
scheduling algorithms for mixed-criticality data (Chaps. 4, 5, and 6). Finally, we 
conclude this book and discuss future research directions (Chap. 7). 

We hope that this book will inspire further research on mixed-criticality industrial 
wireless networks. 


Shenyang, China Xi Jin 
September 2022 Changqing Xia 
Chi Xu 


Dong Li 
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Chapter 1 ff) 
Introduction Geek for 


Abstract In this chapter, we first provide an overview of industrial wireless 
networks from the perspectives of industrial communication requirements and 
classical industrial wireless networks. Then, mixed criticality is introduced, and the 
role of mixed criticality in industrial wireless networks is also discussed. Last, we 
present the organization of this book. 


1.1 Industrial Wireless Networks 


The real economy with manufacturing as the core is a concrete manifestation 
of national economic strength and international competitiveness [1]. In the past 
20 years, the manufacturing industry is undergoing the fourth industrial revolution, 
which is introducing intelligence into the industrial production process, commonly 
known as intelligent manufacturing (or smart manufacturing). Intelligent man- 
ufacturing has been widely recognized as an advanced technology to improve 
production efficiency and reduce production costs. To seize the commanding heights 
of the manufacturing industry, major industrial countries are striving to improve the 
intelligence of the manufacturing industry, and have issued relevant development 
initiatives, such as Germany’s “Industry 4.0”, China’s “Made in China 2025”, 
the United States’ “Advanced Manufacturing Partnership”, the United Kingdom’s 
“Future of Manufacturing”, and France’s “Industrie du Futur” (as shown in Fig. 1.1). 
What these initiatives have in common is that they must be network-based. 

Modern manufacturing has involved a large number of elements, such as 
machines, people, raw materials, etc. In order to make these elements form a whole 
system, networks must be used in industrial systems [2], called industrial networks. 
Industrial networks connect all elements together to implement the collection of 
industrial data and the sending of control commands. These uplink and downlink 
processes constitute a closed-loop control that enables intelligent manufacturing. 
Therefore, industrial networks are one of the important infrastructures of intelligent 
manufacturing. 


© The Author(s) 2023 1 
X. Jin et al., Mixed-criticality Industrial Wireless Networks, Wireless Networks, 
https://doi.org/10.1007/978-981-19-8922-3_1 


2 1 Introduction 


Advanced Manufacturing 


Partnership Made in China 2025 
FED the United State pases China 
4 * 
rà L4 
; ; 
; , 
2010 " 2013 , 
N 2011 v 2045 œ~. 
` s ss 
s ` "ee 
\ x Se 
N ` SS 
Tenas Germany bora the United Kingdom SSeS France 
Industry 4.0 Future of Manufacturing Industrie du Futur 


Fig. 1.1 Initiatives of some industrial countries 


1.1.1 Network-Based Intelligent Manufacturing Systems 


According to a research report by MarketsandMarkets, the global intelligent 
manufacturing market size in 2021 is USD 88.7 billion, and it is expected to reach 
USD 228.2 billion by 2027. Some large international companies have turned their 
attention to intelligent manufacturing and have successfully built related industrial 
systems. In all of these, the network plays a critical role. Typical systems are as 
follows: 


* Siemens provides users with comprehensive intelligent manufacturing solu- 
tions from discrete industries to process industries. For example, in the robot 
collaboration, Siemens' SIMATIC integrator connects robots from different 
manufacturers through the industrial network PROFIBUS, and then implements 
the manufacturing process collaboratively. In the automotive industry, Siemens’ 
digital twin that has been applied in the BMW 13 production process adopts the 
industrial wireless network PROFISAFE to integrate cyberspace and physical 
space. 

* Boeing has always been at the forefront of intelligent manufacturing. Based on 
wide-area networks, it established a global concurrent engineering (GCE) and 
realized the coordinated development of the upstream and downstream of the 
industrial chain. This intelligent manufacturing paradigm reduces development 
costs by 5096. 

* Haier's intelligent flexible factory allows users on the Internet to participate in 
the whole process from product design to manufacturing. The factory opened in 
2013, and after years of continuous innovation, it now has the ability to meet the 
needs of mass customization manufacturing. In 2021, its production efficiency 
has increased by 28%, and the production capacity of a flexible production line 
is nearly double that of a traditional production line. 

* Schneider Electric puts forward the concept of green intelligent manufacturing, 
which integrates energy, automation and the Internet of Things into a holistic 
system. The system connects data terminals and the cloud management system 
through the network, so as to realize the high-efficiency and low-power manage- 
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ment of the whole process of industrial production. The company's ultimate goal 
is to create a sustainable future industry. 

* Mitsubishi Electric has built a lithium battery intelligent manufacturing line to 
support high-precision, high-efficiency and high-flexibility production. The line 
adopts a time-sensitive network to double the communication performance and 
increase the production efficiency by 5096. 


1.1.2 Industrial Communication Requirements 


Since industrial elements must be connected by industrial networks, the quality 
of service (QoS) of the networks seriously affects industrial production processes. 
Once the industrial network cannot deliver data packets to their destinations while 
meeting the communication requirements, it may lead to serious problems such as 
the decline of control performance, the stagnation of industrial production, and even 
safety production accidents. Therefore, before designing an industrial network, we 
must understand the requirements of industrial communications. 

In different application scenarios, networks must meet different requirements. 
The most common requirement is transmission speed, which is present in all 
networks, not limited to industrial networks. Civilian networks, e.g., WiFi and 
cellular networks, are constantly evolving to meet users' transmission speed 
requirements. For this kind of requirement, industrial networks can learn from 
the advanced techniques of civilian networks to solve the problems in industrial 
scenarios. In addition to this, industrial networks should meet the following special 
communication requirements: 


* Real-Time Requirement: The real-time requirement of communications means 
that data packets must be delivered to destinations before their deadlines [3]. The 
real-time requirements in different industrial scenarios are shown in Table 1.1. 
Safety production has the maximum real-time requirement, and data transmis- 
sions have to be completed within 10 ms. Long-delayed data transmissions make 
it impossible for industrial systems to respond to emergency alarms. This will 
cause irreparable losses, such as blast furnace explosion, toxic gas leakage, etc. 
In control-related scenarios, there are some differences in real-time requirements. 
Closed-loop control has higher real-time requirements, while open-loop control 
allows more relaxed real-time transmissions. In monitoring-related scenarios, 
monitoring data do not directly control industrial processes, and log updates even 
allow a delay of several hours. Therefore, monitoring-related applications have 
the minimum real-time requirements. 

* Reliability Requirement: For industrial networks, the reliability requirement 
means the reception rate of data packets that meet all other communication 
requirements. The reliability requirements of different industrial scenarios are 
shown in Table 1.2. Surgical robots have the highest reliability requirement of 
99.999999% [4]. If the control commands of the robot are not delivered correctly, 
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Table 1.1 Real-time 


: Scenario Application | Latency 
requirements 


Safety Emergency shutdown <10 ms 
Leak detection 
High-risk process control 
Control Closed-loop control 10-100 ms 
Automated shutdown 
Open-loop control 2100 ms 
Monitoring Equipment condition monitoring 
Environment monitoring 


System logs 
Table L2 Reliability Scenario Application Reliability 
requirements Safety Robotic aided surgery 99.999999% 
Robotic aided diagnosis 99.9999% 
Control Closed-loop control 
| Mobile robots 
Open-loop control 99.99% 


Monitoring | Equipment condition monitoring | 99.9% 
Environment monitoring 
System logs 


it is possible that the patient will be seriously injured. Diagnostic robots get 
information from bodies without harming the patients, so their reliability can be 
slightly reduced. In control-related scenarios, the reliability of most applications 
should reach 99.9999% [5]. However, since the performance of open-loop control 
is relatively low, their reliability requirements for industrial networks are slightly 
lower. In monitoring-related scenarios, since monitoring data packets are allowed 
to transmit for a long time, they can compensate for the low reliability with 
multiple retransmissions. Therefore, monitoring-related applications have the 
minimum reliability requirements. 

* Massive Connection Requirement: Massive connections [6] mean that a large 
number of network devices are connected to a network at the same time. The 
number of connected devices around the world shows explosive growth (as 
shown in Fig. 1.2). In 2003, there were 0.5 billion connected devices around 
the world, and the number of connected devices per person was 0.08 [7]. 
Then, in 2020, while the world's population is growing slowly, the number of 
connected devices is estimated to have increased to 50 billion, and each person 
has an average of 6.58 connected devices. According to this trend, by 2025, 
the number of connected devices per person will be 9.27 [8]. The same trend 
exists in industrial systems. In the early days of industrial networks, only dozens 
of industrial devices needed to be connected together. Then, with the growth 
of industrial applications and the improvement of intelligence, the number of 
industrial devices is increasing exponentially. At present, existing industrial 
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Fig. 1.2 Massive connection 


systems already connect tens of thousands of devices. This trend will continue, 
and the number of connected devices will be higher in the future. 


1.1.3 Classical Industrial Wireless Networks 


For the special requirements of industrial communications, industrial wired net- 
works (such as time-sensitive networks, PROFINET, Modbus TCP, etc.) are capable 
of meeting the real-time and reliability requirements, but cannot connect millions 
of devices with cables. Therefore, wired networks can only be used in small-scale 
industrial applications, or only as backbone networks. 

Compared with wired networks, wireless networks simplify the network deploy- 
ment and reduce maintenance costs. Hence, industrial applications should apply 
wireless networks to connecting devices. However, due to the inherent openness 
of the wireless environment [9], how to improve the real-time performance and 
reliability of wireless networks has always been the focus of the industry and 
scholars. In classical industrial wireless networks, the related techniques and 
algorithms are as follows: 


1.1.3.1 WirelessHART 


WirelessHART [10] is a global IEC-approved standard (IEC 62591) that specifies 
a robust and reliable wireless sensor-actuator network, and has been widely-used 
in industrial process monitoring and control. WirelessHART is built on top of 
the IEEE 802.15.4 standard, and defines star and mesh topologies (as shown 
in Fig. 1.3). It provides enabling techniques for real-time scheduling such as 
time synchronization, time division multiple access and multiple channels, etc. 
Although the WirelessHART standard excludes real-time scheduling algorithms, 
scholars have proposed many related studies. In 2010, the basic idea of multi- 
processor scheduling was introduced into WirelessHART networks [11]. Thus, in 
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Fig. 1.3 Topologies supported by WirelessHART. (a) Star topology. (b) Mesh topology 
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Fig. 1.4 Real-time scheduling model of WirelessHART networks 


the scheduling model of WirelessHART networks (as shown in Fig. 1.4), multiple 
channels correspond to multiple processors, and the time slot used for a transmission 
corresponds to the running time of a task on processors. However, links and routing 
paths are excluded in multi-processor systems. Therefore, the main difference 
between them is that network scheduling must strictly follow the sequence of links 
in a path, while multi-processor scheduling does not need to consider this. Since 
the network scheduling model is based on the multi-processor scheduling model, 
many multi-processor scheduling algorithms, such as fixed priority assignment, 
dynamic priority scheduling, preemptive scheduling, etc., have been improved to 
use in WirelessHART networks [12-15]. 

In addition, to improve the network reliability, WirelessHART adopts the graph 
routing policy [16-18] (as shown in Chap. 3.2). In a routing graph, there are multiple 
paths from each node to the destination node, so that data packets can be copied and 
sent multiple times to improve the reliability. At the same time, WirelessHART 
supports frequency hopping mechanisms to improve the anti-interference ability of 
transmissions. 
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1.1.3.2 WIA-PA 


WIA-PA (IEC 62601) [19] is also based on the IEEE 802.15.4 standard. The main 
difference from WirelessHART is that WirelessHART defines two topologies (star 
and mesh), while WIA-PA is a hybrid of star and mesh topologies (as shown in 
Fig. 1.5). In the star network, field devices upload their data to the routing device. 
Then, in the mesh network, routing devices transmit data to the gateway. Thus, the 
communication rule of WIA-PA is specified according to the hybrid topology (as 
shown in Fig. 1.6). The mesh network cannot transmit until all transmissions in the 
star network are finished. The hybrid topology can improve the network scalability 
[20, 21] and enables WIA-PA to support most of the WirelessHART strategies. 
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1.1.3.3 WIA-FA 


WIA-FA (IEC 62948) [22] and WIA-PA belong to a family of standards. The main 
difference is that WIA-FA is based on the IEEE 802.11 standard. Hence, WIA-FA 
only defines a redundant star topology (as shown in Fig. 1.7) and does not need any 
routing mechanism. To improve the real-time performance and reliability, WIA-FA 
adopts multiple access points and redundant devices. Based on these features, the 
communication of WIA-FA is segmented (as shown in Fig. 1.8). Access points are 
divided into several groups, and the transmission time is correspondingly divided 
into the same number of segments. Each group is assigned one segment. The access 
points of a group are allowed to occupy only the assigned segment. In a segment, 
first, access points broadcast beacons to field devices, and then according to the 
configuration in beacons, field devices start to work. Since the topology of WIA- 
FA is a simplification of WIA-PA and WirelessHART, some of the algorithms of 
WIA-PA and WirelessHART can be used in WIA-FA. In addition, based on multiple 
access points, a real-time scheduling algorithm for highly reliable retransmissions 
has been proposed to meet industrial communication requirements [23]. 
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1.1.3.4 ISA100.11a 


ISA100.11a (IEC 62734) [24] is also based on the IEEE 802.15.4 standard. Its 
topology is similar to that of WirelessHART (as shown in Fig. 1.9), and most 
of the techniques used in WirelessHART to improve the real-time performance 
and reliability also exist in ISA100.11a, such as time division multiple access, 
multiple channels and graph routing. Therefore, the scheduling model and relevant 
strategies of WirelessHART can also be applied to ISA100.11a. In addition, the 
design goal of ISA100.11 is different from those of the other networks. ISA100.11a 
was designed for ease of use. Flexible configuration interfaces and IPv6 are included 
in ISA100.11a. These techniques enable ISA100.11a to easily connect to other 
networks. 


1.3 Mixed Criticality 


Mixed criticality [25] is a new system characteristic, which is different from the 
real-time performance and reliability of systems. However, the methods proposed 
based on mixed criticality can guarantee the real-time performance and reliability of 
the systems with limited resources. In addition, since the mixed-criticality methods 
are aimed at limited resources, they can also be a solution to support massive 
connections. 
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1.2.1 Mixed Criticality Overview 


Mixed-criticality systems define extensions to real-time embedded systems with 
limited resources. The traditional real-time performance measures the urgency 
of tasks, not their importance. Then, mixed-criticality systems are proposed to 
highlight the importance of tasks. In such systems, important tasks are called 
high-criticality tasks, and unimportant tasks are called low-criticality tasks. Mixed- 
criticality systems can execute in multiple criticality modes, and criticality level 
is also one of the attributes of tasks. To make it clearer, in the following, a system 
model with two criticality levels (high and low) is introduced. For multiple criticality 
levels, the system model can be easily extended. 

As shown in Fig. 1.10, when the system with two criticality levels starts, it 
must be in the low-criticality mode, and all tasks are allowed to execute. During 
system execution, some internal or external dynamic disturbances may cause high- 
criticality tasks to fail to meet performance requirements. Then, the system switches 
to the high-criticality mode. In the high-criticality system mode, high-criticality 
tasks are allowed to use more resources to guarantee their real-time performance and 
reliability, while low-criticality tasks have to be discarded if there are not sufficient 
resources. In this way, the performance of high-criticality tasks can be guaranteed as 
much as possible, and then the system performance can be maximized under limited 
resources. 

For different systems, mixed-criticality models are different [26]. In this book, 
we only focus on industrial wireless networks. 


1.2.2 Mixed Criticality in Industrial Wireless Networks 


In industrial wireless networks, criticality is an inherent property of data packets 
[27, 28]. For example, Fig. 1.11 shows an industrial wireless network for cement 
manufacturing. The rotary kiln is the most important equipment. If the rotary kiln 
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Fig. 1.11 An industrial wireless network in a cement factory 


has some exceptions and its temperature data are lost or miss deadline, workers 
cannot take measures in time. This will lead to production inefficiency. By contrast, 
even if the temperature data of pre-heaters cannot be delivered to the destination 
within deadlines, the temperature of materials can be sensed in the pre-calciner. 
Hence, the temperature data of the rotary kiln are more critical than those of pre- 
heaters. In the wireless network, when the important equipment has an exception, 
its sensing data have to be quickly and reliably delivered to the control room. This 
process needs more network resources. However, the resources of wireless networks 
are very limited. When the resources cannot guarantee the requirements of all data 
packets, the packets that belong to lower-criticality levels have to be discarded. 

Based on the above basic rules of mixed-criticality wireless networks, some novel 
algorithms are needed to analyze and schedule the mixed-criticality packets so that 
as many packets as possible can be satisfied under performance constraints. The key 
feature of mixed-criticality systems is that the resources assigned to high-criticality 
tasks can be dynamically increased during system execution. Therefore, the main 
difference among the various mixed-criticality systems is what the resources are and 
how to use the resources. For example, in mixed-criticality multi-processor systems, 
the resource is the execution time of a task on processors, while, in mixed-criticality 
wireless networks, there are various types of resources, including time slots and 
channels at the data link layer, and routing path at the network layer. In this book, 
we will introduce some algorithms to solve the analyzing and scheduling problem 
of mixed-criticality data packets from the perspective of network resources. 


1.3 Book Organization 


In this book, we will see how mixed criticality helps to improve the performance 
of industrial wireless networks. The rest of this book is organized as follows: In 
Chaps. 2 and 3, we analyze the schedulability of mixed-criticality data packets under 
two classical scheduling algorithms. Chapter 4 presents a new algorithm to schedule 
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mixed-criticality data packets in time-division multiple-access networks. Chapter 5 
extends the model of Chap. 4, and uses the devices with multiple radio interfaces 
to improve the schedulability of mixed-criticality data packets. Chapter 6 addresses 
the mixed-criticality scheduling problem of industrial 5G new radio. Finally, we 
conclude this book in Chap. 7. 
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Chapter 2 A 
Schedulability Analysis of us 
Mixed-Criticality Data Under 

Fixed-Priority Scheduling 


Abstract WirelessHART, as a robust and reliable wireless protocol, has been 
widely-used in industrial systems. Its real-time performance has been extensively 
studied, but limited to the single-criticality case. Many advanced applications have 
mixed-criticality communications, where different data flows come with different 
criticality levels. Hence, in this chapter, we study the real-time mixed-criticality 
communication based on WirelessHART networks, and present an end-to-end delay 
analysis method under fixed priority scheduling. 


2.1 Background 


WirelessHART is based on a centralized network management and multi-channel 
Time Division Multiple Access (TDMA). These special features have attracted 
researchers' attentions, and they have done some studies to improve the real-time 
performance of WirelessHART networks, e.g. [1-4]. However, all these studies 
focus on the single-criticality case. Advanced real applications come with mixed- 
criticality data communications, such as the case of the cement factory in Sect. 1.2.2. 

The key difference between mixed- and single-criticality systems is that the 
criticality of data in mixed-criticality systems must be considered together with 
real-time performance [5]. This leads to the problem that directly using tradi- 
tional priority-based scheduling algorithms of single-criticality systems to mixed- 
criticality systems is infeasible due to independence between criticality and tradi- 
tional priorities [6-12]. Therefore, the traditional real-time theory needs a revision 
to support mixed-criticality networks. 

There are a few related studies on mixed-criticality networks. The work in 
[13-15] designs Network-on-Chips for mixed-criticality multiprocessor systems. 
The work in [16] proposes mixed-criticality protocols for the Controller Area 
Network (CAN), and then a response-time analysis method and an optimal priority 
assignment scheme are provided. The work in [17] designs a virtual CAN controller 
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to provide differentiated services for different criticality levels. The work in [18- 
20] focuses on the wired network—TTEthernet. They propose some scheduling 
algorithms to guarantee the performance of messages under real-time constraints. 
The work in [21] is about wireless networks. It introduces a mixed-criticality 
scheduling method to JPEG2000 video systems based on the IEEE 802.11 standard. 
However, WirelessHART networks are based on the IEEE 802.15.4 standard and are 
quite different from the wireless video system. Therefore, existing system models 
and solutions cannot be used in the WirelessHART model. 

Since the end-to-end delay analysis is the foundation of the real-time theory, 
in this chapter, we present an end-to-end delay analysis method for fixed priority 
scheduling in mixed-criticality real-time WirelessHART networks. The analysis can 
be used to test whether the data flows can meet their special requirements when 
designing a WirelessHART network. 

In the following, first, we introduce the concept of mixed criticality into real-time 
wireless sensor-actuator networks and propose a formulated system model; second, 
we propose an end-to-end delay analysis method, which is a fast feasible method 
to test the reliability of mixed-criticality systems; third, evaluation results show that 
the proposed method is very effective and only incurs little pessimism comparing 
with simulation results and a real testbed. 


2.2 System Model 


2.2.1 Mixed-Criticality Wireless Network Model 


We consider a WirelessHART network characterized by G =< V, E,m >: 


* A WirelessHART network consists of sensor/actuator nodes and a gateway with 
a centralized network manager. We use n nodes V = (vi, vo, ..., Un} to denote 
these devices, and the gateway is vı. Each node is equipped with a transmitter, 
so it cannot send and receive in the same time slot. 

e E: V xV is the set of links. Each element e;; in E represents existing reliable 
communication between v; and v;. Transmitting a packet through one link is 
called transmission. 

* We use m to denote the number of available channels. WirelessHART networks 
support 16 non-overlapping channels. However, since these channels may suffer 
from persistent external interference, not all of them can always be accessed. 
Hence, 0 « m x 16. Each channel supports only one transmission in one time 
slot. 
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The data flow set is denoted by F = (F1, F2,...}. Each flow F; is characterized 
by F; =< xi, ci.ti(x), pi, i >. pi denotes the distinct fixed priority. $; ($j € 
E) is an ordered sequence of links and denotes the routing path of the flow F;. 
The centralized manager of the WirelessHART network collects sensing data and 
distributes actuator data, so the gateway is the source or destination for each flow. 
In the TDMA policy, each time slot allows a one-hop data transmission and its 
acknowledgement to be transmitted. We use c; to denote the number of time slots 
required to deliver a packet from the source to the destination, i.e., c; is equal to the 
number of hops of the flow F;. 

xi denotes the criticality level of the flow F;. For ease of presentation, we only 
focus on a dual-criticality system, in which there are only two criticality levels L 
(low) and H (high). However, it can be easily extended to systems with an arbitrary 
number of criticality levels. Correspondingly, the network also has dual-criticality 
modes (H, L}. If the criticality level of the flow F; is not less than the current 
network mode x, it can be delivered; otherwise, the flow is discarded. The network 
starts in the low-criticality mode (x — L), in which all flows are served. When an 
error or exception occurs in a node, the node will trigger the changing of the network 
mode from low criticality to high criticality (x = H). Then only the flows with 
high-criticality level can be delivered, and the low-criticality flows are discarded. 
Note that the mode change will introduce additional time to the delay of the high- 
criticality flow and the message of mode change should be broadcast to the entire 
network as soon as possible. There are some methods used to solve this problem. For 
example, one channel of each node can be reserved to serve the message. Therefore, 
we only model the duration of the mode change as C, which is used to calculate the 
delay of the packet delivered during the mode change. 

When errors and exceptions occur, workers will handle problems and change the 
mode from high criticality to low criticality. We do not consider this process due 
to the unpredictability of workers' behavior, i.e., we do not study the mode change 
from high criticality to low criticality. The assumption is also widely adopted in 
existing works (such as [22-24] etc.). 

In mixed-criticality uniprocessor/multiprocessor systems, the execution time of 
a task is a function of the system mode. In wireless networks, the number of time 
slots required to deliver a packet is equal to the number of hops and fixed. However, 
the period f; is dependent on the network criticality mode. Since the important flow 
is more frequently delivered when the network mode is changed to high criticality, 
hence, £j (H) < tj(L). 

According to the period f; (x), the flow F; periodically releases a packet, which 
is assigned the parameters specified in F;. Our system adopts the implicit-deadline, 
i.e., the packet's relative deadline is equal to the flow's period corresponding with 
the network mode of generating the packet. For example, if the packet is released 
in the network mode L, then its relative deadline is t;(L). Therefore, in a stable 
network, at most one packet of each flow is active at any time. However, when 
the network mode is changed from L to H, there may exist two active packets 
belonging to one flow because of the change of the flow's period. In this case, the 
packet released in the network mode H has higher priority than another. 
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A packet is released at time slot sı, and arrives at its destination at time slot 
52, then its end-to-end delay is (s2 — sı + 1). The end-to-end delay of a flow is 
the maximum delay among all its packets. If a scheduling algorithm can schedule 
all flows such that all packets’ end-to-end delays are less than or equal to their 
deadlines, the flow set is called schedulable under the scheduling algorithm. 

Note that not all of the above assumptions are supported by the original 
WirelessHART protocol. However, they can be implemented in the application layer 
[1—3]. In Sect. 2.4, our real testbed is introduced. 


2.2.2 Fixed Priority Scheduling 


We focus on the end-to-end delay analysis for fixed priority scheduling, which is 
the most commonly used real-time scheduling in real systems. In fixed priority 
scheduling, transmissions are scheduled within a hyper-period 7, which is equal 
to the least common multiple of periods of all flows, since after that all schedules 
are cyclically repeated. The period supported by the WirelessHART protocol is 2, 
where i is an integer greater than or equal to 0. Therefore, the hyper-period T is 
equal to the maximum period among all flows. At each time slot of T , if there exist 
unused channels, the transmission with the highest priority is scheduled. However, 
if the released transmission shares nodes with the transmissions that have been 
scheduled at this time slot, it cannot be scheduled since one node cannot serve 
more than one transmission at one time slot (as shown in Fig. 2.1). Therefore, there 
are two factors influencing the transmission scheduling: channel contention (there 
are no unused channels assigned to the transmission) and transmission conflicts (a 
transmission cannot be scheduled, if it shares a node with a transmission that has 
been scheduled in this time slot). In other words, the two factors introduce extra 
delays. In the following, we analyze delays introduced by two factors separately. 
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Fig. 2.1 Fixed priority scheduling 
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Table 2.1 Key notations 
Symbol Definition 


G A WirelessHART network 

V Set of all devices in the network G 

E Set of links in the network G 

m Number of available channels 

F; A data flow, F; € IF 

Xi Criticality level of the flow F; 

Cj Number of hops of the flow F; 

x Criticality mode of the current network 

t; (x) Period of the flow F; at the current mode 

Di Priority of the flow F; 

Qi Routing path of the flow F; 

C Duration of the mode change 

Si The ith time slot 

T Hyper-period 

Reh Pseudo worst case delay of the flow F; for single networks 

hp(Fx) Set of flows whose priorities are higher than Fg 

R(x) Worst case delay of the packet that belongs to the flow F; at the network mode x 
r Number of hops that the packet has passed before the mode change 


Problem Statement Given the mixed-criticality WirelessHART network G, the 
flow set IF and the fixed priority scheduling algorithm, our objective is to analyze 
the end-to-end delay for each flow, such that the schedulability of the flow set can 
be determined. 

Table 2.1 summarizes the key notations used in this chapter. 


2.3 End-to-End Delay Analysis 


Our analysis is based on the EDA (End-to-end Delay Analysis) method [1], which is 
the state-of-the-art end-to-end delay analysis for fixed priority scheduling in single- 
criticality real-time WirelessHART networks. To make this chapter self-contained, 
we first introduce EDA. Then we present our end-to-end delay analysis for mixed- 
criticality WirelessHART Networks. 


2.3.1 Analysis for Single-Criticality Networks 


The EDA analysis contains two steps. The first step calculates the delay due to 
channel contention, which is called pseudo upper bound of the worst case end-to- 
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end delay and denoted by R”. Then the second step incorporates the delay due to 
transmission conflicts into the result of the first step. 


2.3.1.1 Pseudo Delay 


The flow F; experiences the worst case delay when the level-k busy period occurs. 
The level-k busy period is the maximum continuous time interval during which all 
channels are occupied by flows of priority higher than the priority of Fg, until Fk 
finishes its active packet transmitting. The notation hp(F;) is used to denote the 
set of flows whose priorities are higher than Fy. If the flow F; (F; € hp(Fx)) has 
a packet with release time earlier than the level-k busy period and deadline in the 
level-k busy period, it is said to have carry-in workload in the busy period. Then 
two types of workload are presented as follows: 


° wò C (F;, œ) denotes the workload upper bound in the level-k busy period of œ 
slots, if F; has no carry-in workload: 


NC, p. EE ; m 
Wi -(Fi,a)- 5 cj + min(@ mod t;, ci) 
i 


where t; denotes the period of F; in single-criticality networks. 
° we I (F;, œ) denotes the workload upper bound in the level-k busy period of a 
slots, if F; has a carry-in workload: 


max(a@ — cj, 0) 


Weta) = | ; 
d 


IE 


where u; = min(max(max(a@ — cj, 0) — (t; — Ri), 0), cj — 1) and R; denotes the 
worst case end-to-end delay of F; in single-criticality networks. 


Similarly, there are two types of interference between F; and Fg during o slots: 
Ij C (Fi, o) = min(W; « (Fi, o), — cy + 1) Q.1) 
IC (Fi, o) = min(WE (F;, o), e — ck + 1) (2.2) 


At most m — 1 higher priority flows have carry-in workload in the network with 
m channels. Therefore, Fx’s total delay due to channel contention is 


X@= D> I (Fi, a) 4- Urla) 
Fiehp(Fx) 


where Uz (o) is the sum of the min(|Ap(F;,)| , m —1) largest values of the differences 
IC! (Fi, œ) — INC (Fi, o) among all F; € hp( Fx). 
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The WirelessHART network contains m channels, so Eq. (2.3) shows the delay 
due to channel contention. And the pseudo upper bound Re is the minimal value 
of a that solves Eq. (2.3). a can be found using the iterative fixed-point algorithm 
[25], which is widely used in the delay analysis of real time systems. The iterative 
calculation of o starts ata = cg. During the iterations, if œ is larger than the deadline 
of the flow F;, the algorithm terminates and the flow set is unschedulable; if the 
value of o is fixed and less than the deadline, the fixed-point is R”. 


a= [2m + Ck (2.3) 
m 


2.3.1.2 Worst Case Delay 


This step incorporates the delay due to transmission conflicts into R” to calculate 
the actual end-to-end delay Rx. First, we introduce some definitions. 


e Q(k,i): the total number of F;'s transmissions that share nodes with F;'s 
transmissions. 

* ój(k, i): the number of nodes along the jth maximal common path between Fx 
and F;. 5^ (k, i) is the length of the maximal common path with a length of at 
least 4. The delay caused by a maximal common path is at most 3, so the extra 
length is specially marked using $^ (k, i). 

e A(k,i): the upper bound of end-to-end delay due to transmission conflicts that 
F; contributes to Fx, 


oO 
Atk, i) = Q.i) - Y. (5) (k, i) — 3) 
j=l 
where o is the number of maximal common paths between Fx and F;. 


Thus the upper bound of the actual delay Ry is the minimal solution of Eq. (2.4) 
by running the iterative fixed point algorithm starting at 6 = Re 


p=R}+ OM HEZ (2.4) 


FEehp(Fy)' | 
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2.3.2 Analysis for Mixed-Criticality Networks 


Mixed-criticality networks dynamically change the network mode, which results in 
three types of packets transmitted in the network: 


* The release time and deadline of a packet are all in the network mode L. The 
end-to-end delay of this packet is denoted by Ry (1L). 

* The release time and deadline of a packet are all in the network mode H. The 
notation Ri (H) is used to present the upper bound of its delay. 

* When the network mode is changed, the packet, which is released by high- 
criticality flow in the network mode L, cannot be dropped. In this situation, the 
packet's release time is in the network mode L, but its deadline is in the H mode. 
Flows formed by these packets are delivered only once. The notation F’ presents 
the set of these flows and R, (L2H) denotes the upper bound of the delay. 


Ry (L) is unaffected by the mode change and is equal to the delay Rg calculated 
in single-criticality networks. Therefore, we only analyze R&(H) and RLCL2H ). 


2.3.2.1 Analyzing R;(H) 


In the network mode H, packets belonging to the high-criticality flow are delivered, 
no matter when they are released. Therefore, Ry (#7) is interfered by the following 
two flow sets 


hpL (Fe) = (Fi|Fi € F', pi < pr, xi = H}, 
hpH(Fx) = AFi|Fi € F, pi > pk, xi = HJ. 
From these, we can derive that the delay due to channel contention is 


(v) = ba INC (Fi, a) + Uo) 
F,ehpH (Fy) OhpLl (Fy) 


where Ux (a) is also for the interferences of ApH (Fx) and hpL(Fx). Note that the 
interferences of flows in ApH (Fx) are the same with Eqs. (2.1) and (2.2), since 
the flows release packets periodically. However, the flow F; (F; € hpL(Fx)) is 
delivered only once. In the worst case, its workload in the level-k busy period is 
min{qa, cj). Therefore, 


VF; € hpL (Fx) : IF} (Fi, o) = INC (Fj, œ) = min(min(a, cj), œ — ck + 1) 
Then the pseudo upper bound R” (H) can be derived based on Eq. (2.3). 


For the delay due to transmission conflicts, similarly, besides the periodic flows 
in ApH (Fx), the flows in hpL(F;) delivered only once will introduce A(k, i) to 
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Rg(H). Therefore, 


B-RPGDE >) E In 2, 46% — Q5 


rehpiicro | 10D FehpL(Fi) 


According to Eq. (2.5), the iterative algorithm can be used to find the fixed £, i.e., 
Ry (A). 


2.3.2.2 Analyzing R, (L2H) 


The flow Fy (Fg € F’) is divided into two flows. The first flow F;;, is delivered in 
the L mode, and the second flow Fxy is in the H mode. We use R} (L) and R; (H) 
to denote the delays of Fy, and Fy 7, respectively, where r means that the packet has 
passed through r hops before the mode change, andr € [0, c; — 1]. Correspondingly, 
CkL =r and ck = cy — r. And priorities of Fy, and Fg are assigned as py. 

The calculation of Rz (L) is the same as that of Rx(L), since they are all in the 
stable network. However, Rg (H) is different from R&(H). According to our system 
model, packets released by Fy in the network mode H have higher priority than the 
packets of Fig. Therefore, the delay contributed by these higher priority packets 
must be added to R7 (H), i.e., Fey is interfered by hpL( Fy), hp H (Fx) and { Fx} in 
the network mode H. From these, we can derive 


Qu (o) = > INS (Fi, œ) + Urn (o) 
FiehpH (Fe) OhpL (FRM Fe} 


where Ugy is for hpL( Fy), hp H (Fx) and {Fz}. For the flow Fig, Fx releases higher 
priority packets periodically. The interference introduced by Fx is the same as that 
by hp H (Fx). Therefore, Eqs. (2.1) and (2.2) also can be used to calculate it. 

For the delay due to transmission conflicts, the packets released by {F} must be 
considered. Then the actual end-to-end delay is shown as follows: 


B = R (H) + > EXE );. A(kH, i) 
{Fx} 


ti (H) 
FrehpH (Fen FrehpL (Fi) 


And R; (Ħ) is also solved by the iterative algorithm. 

The range of r is from 0 to cg — 1. If r = 0, it means that the packet has been 
released but not been delivered before the network mode is changed to H. Thus, 
Ciz = O. This will cause the failure of the iterative algorithm. Therefore, if 3F; and 
Pi > Pk, starts with 1; otherwise, there is no interference for Fx and Rg (L)=0. 
If r = cx, it means that the packet has been delivered to its destination in the L 
network mode. Hence, the delay of the packet is Rx (L). 
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The delay of Fx is the sum of R} (L.) and R; (H). However, different values of r 
lead to different R; (L) and R (H). Therefore, the upper bound of F;'s delay is 


Ry(L2H) = max {R;(L)+ R,(H)) +C 
re€[0,c;—1] 


where C is the additional time introduced by the mode change (shown in Sect. 2.2). 
To sum up, if the flow Fx satisfies R&(L) < t(L), Re(H) < tk(H) and 
R&(L2H) < tf&j(L), then it is schedulable. In a flow set, if all the flows are 


schedulable, the flow set is schedulable. The calculation of our analysis is in pseudo 
polynomial time because our analysis is based on the iterative fixed-point algorithm. 


2.4 Performance Evaluations 


In this section, we will compare our analysis method with simulations and a real 
testbed. 


2.4.1 Simulations 


In order to illustrate the applicability of our method, for each parameter configu- 
ration, 100 test cases are generated randomly. For each test case, the gateway is 
placed at the center and other nodes are placed randomly in the playground area A. 
According to the suggestion in [26], the number of nodes n and the playground area 
A should satisfy 


where the transmitting range d is set as 40 m. Then, each node connects to the 
nearest node, which must be in its transmitting range and has been connected to the 
gateway. If some nodes cannot connect to the gateway, their locations are generated 
randomly again. 

The flow set F contains 0.8 - n flows. Other parameters are set as follows. We 
use the utilization u (u = Y; “if,) to control the workload of the entire network, 

VF; eF 

and UUniFast algorithm [27] is used to generate each flow’s utilization u; (u; =“ 
/;). The result generated by UUniFast algorithm follows a uniform distribution and 
is neither pessimistic, nor optimistic for the analysis. For the flow F;, its criticality 
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level is assigned randomly. If x; = H, then t;(L) = 2[loe7] and ti(H) = 2|ioe7 ; 
otherwise, t;(L) = 2[127] and t;(H) = +00. The mode change duration C is set 
as the maximum number of hops between any two nodes of the network. If one 
channel and one transmitter of each node are reserved to serve the mode change, 
the change command can be broadcast to all the nodes in the duration C. The 
fixed priority assignment follows the two classical algorithms [28]: (1) Deadline 
Monotonic (DM), in which the flow with the shorter deadline is assigned the higher 
priority; (2) Proportional Deadline monotonic (P D), in which the flow with the 
shorter subdeadline is assigned the higher priority. Subdeadline is defined for its 
deadline divided by the total number of its transmissions. 

The mode change can occur at any time slot. Hence, the simulation should list 
all cases. However, for the complex state space, the execution time of simulations is 
unacceptable. Therefore, if the execution time exceeds 30 minutes, the simulation 
is suspended and the maximum delay is chosen as the worst case end-to-end delay. 
We use pessimism ratio (the proportion of our analyzed delay to the maximum delay 
observed in simulations) and acceptance ratio (the percentage of flow sets that are 
schedulable) as the performance metrics. 

Figure 2.2 plots the pessimism ratios with different numbers of nodes. We set 
that m = 12 and u = 1. In order to make test cases simulated in an acceptable 
time, the number of nodes is only up to 110. From the figures, we can see 
that the 75th percentile of the pessimism ratios is less than 2.1 and 2.2 for DM 
and PD, respectively. In [1], the result of the state-of-the-art analysis EDA for 
single-criticality networks is 1.5 and 1.6, respectively. Compared with them, our 
analysis only introduces a small degree of pessimism, even though the mode change 
increases the complexity of the end-to-end delay analysis. Therefore, our analysis is 
highly effective. 

In order to evaluate the performance of our analysis method for the larger scale 
networks in an acceptable time, we set m — 6 and u — 1. Figure 2.3 shows the 
boxplots of the pessimism ratios under varying network sizes. From the evaluation 
results, we know that our analysis method is stable under different network sizes. 
Comparing with Fig. 2.2, Fig. 2.3 is more pessimistic. The less number of channels 
introduces more contentions, and the delay analysis is to consider the worst case 
scenario. Thus, all of additional contentions are considered in the delay analysis, but 
not all of them appear in simulations. Therefore, the analysis with fewer channels is 
more pessimistic. 

We compare the acceptance ratios of our analysis and simulations, and the 
utilization u is increased to 3.2. Figure 2.4 shows the comparison, in which AMC is 
our Analysis for Mixed-Criticality networks and S7 M is the result of simulations. 
We observe that our results are close to those of simulations. Therefore, our analysis 
method can be used to verify whether flows can meet their deadlines or not before 
implementing the real system. 
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Fig. 2.2 Pessimism ratio 
under varying network sizes 
with m — 12. (a) The priority 
assignment policy DM. (b) 
The priority assignment 
policy PD 
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Comparing Fig. 2.4a and b, we observe that the acceptance ratio of the policy PD 
is less than that of the policy DM. It is because that, compared with the policy DM, 
the policy PD introduces more interferences to the flows with short paths, which 
leads to a longer delay. Similarly, all the interferences are considered in the delay 
analysis, but not all of them appear in simulations. Therefore, in Figs. 2.2 and 2.3, 
the result of PD is more pessimistic than that of DM. 
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Fig. 2.3 Pessimism ratio 
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2.4.2 Real Testbed 


We implement a real testbed that contains three types of physical devices: the 
gateway device, routing devices and field devices. The gateway device manages 
the network and adopts a low power SoC (System of Chip) AT91RM9200 and a 
CC2420 transceiver chip. The routing device is implemented on an MSP430 and 
a CC2420. The field device is equipped with a temperature and humidity sensor 
SHT15 besides an MSP430 and a CC2420. Our testbed supports the IEEE 802.15.4 
protocol, which is the physical and MAC (Medium Access Control) layers of Wire- 
lessHART networks, and an improved WirelessHART network according to our 
requirements. The improved WirelessHART implements the specific requirements 
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Fig. 2.4 Acceptance ratio 1.06 
under varying utilizations 0.94 
with m = 6. (a) The priority 
assignment policy DM. (b) 
The priority assignment 
policy PD 
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in the application layer, and it is compatible with the original WirelessHART. 
Channel 23 is used to broadcast mode change messages and configuration messages. 
Six schedulable channels are 15-20. Additionally, six devices are configured as 
sniffers to monitor packets transmitted on the six channels. Then the sniffed packet 
with a timestamp is sent to a PC via an 8-port RS-232 PCI Express serial board. 
Figure 2.5 shows our testbed. The network is deployed in a building. For 
each parameter configuration, 100 test cases are implemented. The generation of 
configurations is the same as in simulations. The configuration message is sent to 
devices via the gateway. Figure 2.6 shows pessimism ratios in a certain scope under 
different parameters. The point of the pessimism ratio 1.4 reports the number of test 
cases, whose pessimism ratios are between 1.4 and 1.6. When the utilization is set 
as 1 and the number of channels is 12, compared with PD and DM, our average 
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pessimism ratio of 100 test cases is about 2.5 and 2.4, respectively. The result is 
more pessimistic than the simulations. It is because that real cases only cover a 
little state space. The delay observed in the real testbed is not the worst case delay, 
while our analysis focuses on the worst case. Therefore, for the end-to-end delay, 
our analysis method is more reliable than real tests. 


2.5 Summary 


Multiple criticality levels co-exist in real-life wireless networks. However, previous 
works only focus on the single-criticality network. We present an end-to-end delay 
analysis method for fixed priority scheduling in mixed-criticality WirelessHART 
networks, which can be used to determine whether all flows can be delivered to 
their destinations within their deadlines. In evaluations, we compare our analysis 
results with simulations and a testbed. The results show that the pessimism of our 
analysis is acceptable and reliable. 
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Chapter 3 (f 
Schedulability Analysis ish 
of Mixed-Criticality Data Under EDF 
Scheduling 


Abstract In this chapter, to improve the schedulability of high-criticality flows 
when the network is running, we present a supply/demand bound function analysis 
method based on earliest deadline first (EDF) scheduling. In addition, our method 
considers both source routing and graph routing. At the beginning, when the 
network is in low-criticality mode, source routing is applied. When errors or 
exceptions occur, the network switches to high-criticality mode, and network 
routing turns to graph routing to guarantee that critical flows can be scheduled. By 
estimating the demand bound for the mixed-criticality data model, we can determine 
the schedulability of industrial wireless networks. 


3.1 Background 


Graph routing [1] as an effective way to improve network reliability has been 
widely used in recent years. A network under graph routing allocates two dedicated 
time slots for each transmission; if the first transmission fails, a retransmission 
will be sent. Furthermore, the controller assigns a third shared slot on a separate 
path for another retransmission. Since graph routing is a reliable method to handle 
transmission failures, a few works have begun to focus on graph routing. The work 
in [2] presents the first worst-case end-to-end delay analysis for periodic real-time 
flows under reliable graph routing. The work in [3] studies the network lifetime 
maximization problem under graph routing. However, graph routing introduces 
great challenges for real-time analysis. Many conflicts are generated on a large 
number of transmission tasks. Obviously, the task which is more critical but has 
a low priority may miss its deadline in this situation. However, many systems 
need to guarantee high-criticality task's schedulability even though in the worst 
case. That is really very important in many scenarios such as industrial production 
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line, vehicle driving system, etc. To improve the schedulability of high-criticality 
flows when the network is running, we introduce resource analysis into mixed- 
criticality industrial wireless networks. Mixed-criticality network can improve 
the schedulability of high-criticality flows by dynamically switching the network 
criticality, and resource analysis is a major way to analyze the schedulability in 
real-time systems. Combining mixed-criticality network and resource analysis, we 
can estimate the schedulability of networks with different critical levels. 

In this chapter, we propose a novel industrial network model with EDF schedul- 
ing. Our objective is to improve the network reliability, especially for high-criticality 
flows to arrive at their destinations on time even though in the worst case. We 
analyze the network schedulability by the method of resource analysis. The network 
is reliable when the network resource supply is no less than the network upper 
demand in any length of time slot. The main challenges in our work are (1) how 
to evaluate network demand when a network switches from low-criticality mode to 
high-criticality mode and (2) how to tighten the network demand bound function to 
ensure that the analysis result is not too pessimistic. The network we focus on, in 
the beginning, works in low-criticality mode, and the flows transmit under the EDF 
policy [4] and source routing. The packets are transmitted from the source to the 
destination on the primary paths; when an error occurs or the demand changes, 
the network switches to high-criticality mode to enhance the schedulability of 
high-criticality flows. The network substitutes reliable graph routing for source 
routing. Furthermore, we present a supply/demand bound analysis method to 
analyze the schedulability of periodic flows in industrial wireless sensor networks. 
By comparing the relationship between network supply bound and demand, we 
can predict whether the network can be scheduled. The current study makes the 
following key contributions: 


1. We propose a mixed-criticality industrial network, in which network routing 
switches from source routing to graph routing when the criticality mode changes. 

2. We theoretically derive the supply/demand bound function as a novel analysis 
method for industrial wireless networks. By analyzing channel contention and 
transmission conflict, we obtain the upper-bound function of demand in any 
length of time slot. When given a network supply bound function, we can 
determine the schedulability of flows under different criticality modes. 

3. We tighten the demand bound by analyzing carry-over jobs (which are released 
but not finished at the switching slot) and discussing the number of conflicts 
between two flows. 

4. Our method can be applied for general networks. By calculating the maximum 
demand bound of networks, we can analyze network schedulability in the system 
design stage; after network deployment, the upper bound of communications can 
be obtained by our method. 
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3.2 System Model 


We consider an industrial wireless network consisting of field devices, one gateway, 
and one centralized network manager. Our network is proposed in three aspects. We 
first propose a network model that is abstracted from mainstream industrial network 
standards. Then, we introduce a mixed-criticality network. Finally, we apply EDF 
scheduling in the industrial network. 


3.2.1 Network Model 


Without loss of generality, our model has the same salient features as WirelessHART 
and WIA-PA, which make it particularly suitable for process industries: 


Time Division Multiple Access (TDMA) In industrial wireless sensor networks, time 
is synchronized and slotted. Because the length of a time slot allows exactly one 
transmission, TDMA protocols can provide predictable communication latency and 
real-time communication. 

Route and Spectrum Diversity To mitigate physical obstacles, broken links, and 
interference, the messages are routed through multiple paths. Spectrum diversity 
gives the network access to all 16 channels defined in the IEEE 802.15.4 physical 
layer and allows per-time slot channel hopping. The combination of spectrum and 
route diversity allows a packet to be transmitted multiple times, over different 
channels and different paths, thereby handling the challenges of network dynamics 
in harsh and variable environments at the cost of redundant transmissions and 
scheduling complexity [5]. 

Handling Internal Interference Industrial networks allow only one transmission in 
each channel in a time slot across the entire network, thereby avoiding the spatial 
reuse of channels. Thus, the total number of concurrent transmissions in the entire 
network at any slot is no greater than the number of available channels. 


With the above features, the network can be modeled as a graph G = (V, E, m), 
in which the node set V represents the network devices (all sensor nodes in our 
model are fixed), E is the set of edges between these devices, and m is the number 
of channels. Network routing is shown in Fig.3.1; our model supports both source 
routing and graph routing. Source outing is well known in academic research; we 
will not explore it in this article. Graph routing is a unique feature of industrial 
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Fig. 3.1 Network routing. (a) Source routing. (b) Graph routing 


wireless sensor networks. In graph routing, a routing graph is a directed list of paths 
that connect two devices. As shown in Fig. 3.1b, graph routing has a primary path 
and multiple backup paths. This provides redundancy in the route and improves the 
reliability. As stated in the standard of WirelessHART, for each intermediate node 
on the primary path, a backup path is generated to handle link or node failure on the 
primary path. The network manager allocates o dedicated slots, a transmission and 
(a — 1) retransmission on the primary path. A (o + 1)th shared slot is allocated on 
the backup path, usually œ = 2. In a dedicated slot, one channel only allows one 
transmission. However, for the case of a shared slot, the transmissions having the 
same receiver can be scheduled in the same slot. The senders that attempt to transmit 
in a shared slot contend for the channel using a carrier sense multiple access with 
collision avoidance (CSMA/CA) scheme [2]. Hence, multiple transmissions can be 
scheduled in the same channel to contend in a shared slot. For instance, the network 
manager allocates two dedicated slots for the packet transmits from node S to node 
Vj in Fig. 3.1b. After the transmissions on the primary path, a third slot is allocated 
for the packet transmits from node S to V5 as a backup path. When two backup paths 
intersect at node V3, they can avoid collision by CSMA/CA. 

It is important to note that the receiver responds with an ACK packet before 
retransmission and backup; the sender retransmits or sends a backup packet when 
it does not receive an ACK. Because ACK is a part of the transmission, we do not 
need to especially analyze the demand of ACK. 


3.2.2 Mixed-Criticality Network 


A periodic end-to-end communication between a source and a destination is called 
a flow. Network switch instruction is a part of the control flow. Because we analyze 
network total demand, we need not distinguish whether a flow is a data flow or 
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a control flow. The total number of flows in the network is n, denoted by F = 
{F\, Fo,..., Fn). Fi is characterized by < tj, di, E, cj, dj >, 1 € i € n, where t; is 
the period; d; is the deadline; & is the criticality level (we focus on dual-criticality 
network (LO, HI});& = 2, means the network allocates two slots, one transmission 
and one retransmission. Our model can be easily extended to networks with an 
arbitrary number of criticality levels (by increasing the number of retransmissions 
on the primary path); c; is the number of hops required to deliver a packet from 
source to destination. When the network mode switches to high criticality, we denote 
the total transmission hops of both the primary path and shared paths as C;; and ¢; 
is the routing path of the flow. Thus, we can describe each flow F; as follows. F; 
periodically generates a packet at its period t;, and then sends it to the destination 
before its deadline d; via the routing path $; with c; hops. 

In the beginning, messages are transmitted under source routing in low criticality. 
When an error occurs or the demand changes, the control flow will send a switch 
instruction, and the network will switch to high-criticality mode. To enhance 
network reliability, the messages are transmitted under graph routing when the 
network is running on high-criticality mode. This is an irreversible process; high- 
criticality mode will never switch back to low-criticality mode (the analytical 
method of irreversible processes is similar to criticality mode switches from low to 
high). After the switch, we are not required to meet any deadlines for low-criticality 
flows, but high-criticality flows may instead execute for up to their high-criticality 
level characters. 


3.2.3 EDF Scheduling in Industrial Networks 


In this subsection, we provide an overview of the earliest deadline first scheduling 
under industrial wireless sensor networks to analyze network schedulability. EDF 
scheduling is a commonly adopted policy in practice for real-time CPU scheduling, 
cyber-physical systems, and industrial networks [6]. In an EDF scheduling policy, 
each job priority is assigned by its absolute deadline, and the transmission is 
scheduled based on this priority. Each node in our network is equipped with a 
half-duplex omnidirectional radio transceiver that can alternate its status between 
transmitting and receiving. There are two kinds of delay in industrial wireless sensor 
networks, which can be summarized as follows: 


* Channel contention: each channel is assigned to one transmission across the 
entire network in the same slot. 

* Transmission conflicts: whenever two transmissions conflict, the transmission 
that belongs to the lower-priority job must be delayed by the higher-priority one, 
regardless of how many channels are available. It is important to note that one 
node can perform only one operation (receiving or transmitting) in each slot. 


In EDF scheduling, the priority is inversely proportional to its absolute deadline. 
We explain the operating principle of EDF scheduling in Fig.3.2. There are two 
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Fig. 3.2. An example for EDF scheduling. (a) Routing. (b) EDF scheduling 


channels(CH1 and CH2) and flows in this network. At the beginning, the priority of 
F is higher than F| since d? = 4 < dı = 5. Then the controller allocates CH1 for 
F> first. The flow with lower priority must be delayed when transmission conflict 
occurs such as F; will be delayed by F> at the 3rd time slot. At the 5th time slot, the 
second packet is generated by F with an absolute deadline 8, which is larger than 
5. Hence, the priority inversion, and CH1 are allocated to F}. 

Channel contention occurs when high-priority jobs occupy all channels in a 
time slot; a transmission conflict is generated when several transmissions involve 
a common node at the same dedicated slot, and a low-priority job is delayed by 
high-priority ones. However, for the case of shared slots, transmissions with the 
same receiver can be scheduled in the same slot. When channel contention occurs 
between backup paths, the senders on the backup path use a CSMA/CA scheme to 
contend for the channel, and a network delay will not result in this condition. For a 
network under graph routing, two paths $; and $; involving a common node may 
conflict in four conditions: 


1. ¢; is a primary path, $; is a backup path; 
2. both ¢; and $; are primary paths; 
3. both ¢; and $; are backup paths; 
4. dj is a backup path, $; is a primary path. 
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Except for condition 3, the other three conditions may generate transmission 
conflicts. Consequently, the total delay caused by these conditions depends on how 
their primary and backup paths intersect in the network. 

In a real-time system, one task is schedulable when it could be executed 
completely before its deadline. Hence, the flow could be scheduled when all the 
packets generated by the flow could arrive destination before their relative deadlines. 
Then we define the network schedulability as whether or not all flows in a network 
are schedulable. 


3.3 Problem Formulation 


Given a mixed-criticality industrial network G = (V, E, m), the flow set F and the 
EDF scheduling algorithm, our objective is to analyze the relationship between 
the maximum execution demand of the flows and network resource in any time 
interval such that the schedulability of the flow set can be determined. A successful 
method to analyzing the schedulability of real-time workloads is to use demand 
bound functions [7, 8]. We introduce this concept into industrial wireless sensor 
networks and propose two definitions as follows: 


Definition 3.1 (Supply Bound Function) A supply bound function sbf(l) is the 
minimal transmission capacity provided by the network within a time interval of 
length 1. 


Definition 3.2 (Demand-Bound Function) A demand bound function dbf(F;, 1) 
gives an upper bound on the maximum possible execution demand of flow F; in 
any time interval of length 1, where demand is calculated as the total amount of 
required execution time of flows with their whole scheduling windows within the 
time interval. 


There are methods for computing the supply bound function sbf (J) in single- 
processor systems [9, 10]—for example, a unit-speed, dedicated uniprocessor has 
sbf (D = 1. We say that a supply bound function sbf is of no more than unit speed 
if 


sbf(0) =OAVI,k > 0:sbf(l+k) —sbf(l) < k. (3.1) 


Because each channel can be mapped as one processor, the supply bound 
function sbf of the industrial network can be bounded as 


sbf (0) =OAVI,k > 0:sbf(l +k) —sbf(l) x Ch xk, (3.2) 
where Ch is the number of channels in the network. Furthermore, as a natural 


assumption of all proposed virtual resource platforms in the literature, we assume 
that the supply bound function is piecewise linear in all intervals [k, k + /]. In TDM 


40 3 Schedulability Analysis of Mixed-Criticality Data Under EDF Scheduling 
(time division multiple), the network supply bound function can be expressed as 
I 
sbf(I) = max(l mod © — © + 6,0) + lo! o, (3.3) 


where © is the period of TDM, and 4 is the length of slots allocated to the 
transmission. 
In different modes, the schedulability of the flow set is determined as follows: 


Y dbfLo(Fi, D) < sbfro(), VI 0. (3.4) 

F;ieF 

XO abfui(FiD) < sbfui(, VI > 0. (3.5) 
F;eHI(F) 


Similar to real-time scheduling, the flow set is scheduled when the network is 
satisfied by Eqs. (3.4) and (3.5). However, in contrast to real-time scheduling, there 
are two kinds of delays in industrial wireless sensor networks, channel contention 
and transmission conflicts. When a transmission conflict occurs, a high-priority job 
will influence a low priority job, and thus, the flows are not independent. 

Note that transmission conflict is a distinguishing feature in industrial wireless 
sensor networks that does not exist in conventional real-time processor scheduling 
problems. To analyze the network demand in any time interval, we must consider 
the delay caused by transmission conflicts. 

Moreover, in mixed-criticality networks, there may be some jobs that are released 
but not finished at the time of the switch to high-criticality mode; we define these 
jobs as carry-over jobs. We must analyze carry-over jobs to tighten the demand 
bound of the network. 


3.4 Demand-Bound Function of Industrial Networks 


In this section, we analyze the network demand bound function for a single- 
criticality network and mixed-criticality network. For the single-criticality network, 
we study the demand bound function from channel contention and transmission 
conflicts. On this basis, we then analyze the delay caused by carry-over jobs (the 
job that is released but not finished at the time of the switch) in the mixed-criticality 
network. Finally, we study the methods for tightening the network demand bound 
function. 
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3.4.1 Analysis of Single-Criticality Networks 


In this subsection, we study the demand bound function under a single-criticality 
network in two steps. First, we formulate network transmission conflict delay with 
path overlaps; we then analyze the network dbf. To make our study self-contained, 
we present the results of the state-of-the-art demand bound function for CPU 
scheduling [11, 12]. Assuming that the flows are executed on a multiprocessor 
platform, the channel is mapped as a processor. We can obtain the network demand 
caused by channel contention in any time interval / as 


ch 1 - I — di 
dbf" — —» d | —— +1 peill. (3.6) 
d i=1 fi 8g 


Equation (3.6) considers only the delay caused by channel contention, denoted 
as dbf (D). The jobs are conflicted when their transmission paths have overlaps. 
As shown in Fig. 3.3, the priority of the job in F; is higher than the one in F;, so the 
job in F; may be delayed by in F; at nodes V and V; to Vj (we assume the network 
is connected and do not consider the case where the path disconnects). 

Transmission conflicts are generated at the path overlaps, and the network 
requires more resources to solve the transmission conflicts. To obtain dbf (J) of 
the network, we must first study the relationship between conflict delay and path 
overlap. However, estimation transmission conflict delay by the length of the overlap 
is often a pessimistic method. As shown in Fig. 3.3, the delay is much smaller than 
the length of the path overlap. To avoid pessimistic estimation, we introduce the 
result proposed by Saifullah in [13]. The length of the kth path overlap is denoted as 
Len, (ij), and its conflict delay is Dj (ij). For the overlap as Vy — ... Vp, if there 
exists node u, w € V such that u — Vj — ... Vp — wis also on F;’s route, then 
Leny(ij) = h + 1. If only u or only w exists, then Lenj(ij) = h. If neither u nor 
v exists, then Leng (ij) = h — 1. In our example, Len, (ij) = 2, Lenz (ij) = 7 and 
D(ij) = Dij) + D5(ij), which is at most 2 + 3 = 5. Obviously, Len; (ij) is the 
upper bound of D; (ij), which means Lenj(ij) > Dx(ij). For the flow set F, the 
total delay caused by transmission conflicts A is 


A= M, Dyüjz Y; Leng(ij). (3:1) 


1<i,j<n 1<i,j<n 


By the Lemma proposed in [13], the estimation of the delay caused by overlap 
with a length of at least 4 can be tightened. We then formulate the total transmission 
conflicts between F; and Fj as 


à(ij) sij) 
AGj) = 3. Lenk(j) — Y» (Lene (ij) — 3), (3.8) 


k=1 k'=1 
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Fig. 3.3 Anexample of 
transmission delay 


where 8(ij) is the number of path overlaps, 5’(ij) is the number of paths overlap 
with a length of at least 4. Because all flows have a periodic duty, we denote T as the 
least common multiple of flow set F (because the period is an integral multiple of 2, 
T is equal to the longest period among F). Network dbf changes with time interval / 
while it slides from 0 to T. However, Lemma 3.2 proposed by Saifullah is scheduled 
under fixed priority, so the priorities of flows are variable under EDF scheduling. 
We must analyze whether Saifullah's result is suitable under EDF scheduling. We 
denote the mth job generated by F; as F7", and our objective is to estimate the delay 
caused by transmission conflicts by analyzing the number of conflicts. 


Lemma 3.1 FF and E are two jobs of flow i and j, when p and Fi ( FF € 


hp(F;) ) conflict, the job 24 will never be blocked by the job FP. However, 
FF" may be blocked by F$. 
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Proof Atthe beginning, the priority of FF is higher than E. which means d 4 <d 2: 
As Fig. 3.3 shows, two flows may conflict at Vi, and Fj; is delayed by F;. When FE 
is forwarded to V}, two jobs may conflict again. If F; * is blocked by F; Kem 


obtain gt > age. Because a > d, this contradicts with d* « d. Hence, Er 
pre. 


, We can 


will never be ada by 


We prove that F; kem ig blocked by F H through an example. We use the following 
simple flow set: F| = (c1 = 1, dı = tı = 2} and Fo = {c2 = 1, d2 = h = 3}. 

At the beginning, the priority of F. : is higher than F}, because the absolute 
deadline is 2 and 3, respectively. At time slot 2, another job is generated by Fi 
with the absolute deadline of 2. However, the absolute deadline of F} is 1, F: r is 


blocked by F}. Hence, pres can be blocked by F a, H 


Because a path is a chain of transmissions from source to destination, in 
considering the conflict delay caused by multiple jobs of F; on flow Fj, we analyze 
the number of conflicts for F; and F;. Thus, Lemma 3.2 establishes the upper bound 
of this value. 


Lemma 3.2 When Fj and F; conflict, within any time interval of length l, each job 
of F; can be blocked no more than [Z1 times, and Fj can be blocked by Fj no more 


than [A] times. 
J 


Proof Based on Lemma 3.1, we know that the priority inversion will occur in the 
process of transmission. If FF is a higher-priority job than F E , the jobs released 


after Fi must be blocked by pt until F$ is finished. If all jobs generated by F; 
$ 


lj 


satisfy d « d, where k and d are the first jobs for F; and Fj, respectively, 
in /, then there are no more than [z | jobs of F;. Beyond that, because there is no 
transmission conflict, the other jobs of Fj are not blocked by F;. Hence, F; can be 
blocked by F; no more than [ al times. The same as F;, F; can be blocked by F; no 
more than [71 times. 

oO 


By Lemmas 3.1 and 3.2, we can estimate the network demand caused by the 
transmission conflict. Based on Eq. (3.6), we obtain the upper bound of dbf (I) as 
follows: 


Theorem 3.1 Jn any time interval of length l, the demand bound function under a 
single-critical network (low-criticality mode) is upper-bounded by 


n 


1 l — di 

abfro - Y [d epa] + b» (AGj) max([- mi, p" ;». 
i=l : <i, j<n 

2 (3.9) 
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Proof Network demand is the upper bound in a time interval of length 1, which 
consists of two parts, channel contention and transmission conflict. The demand 
of channel contention is bounded by Eq. (3.6). For the demand of the transmission 
conflict, we first analyze each time conflict delay for every two paths by Eq. (3.8); 
the number of conflicts can then be obtained by Lemma 3.2. We can obtain the 
network demand of transmission conflict as 


l l 
2 (AGA) max([—1, [=1). (3.10) 


I<i,j<n J 


Hence, we can obtain the demand bound function under a single-critical network 
upper-bounded by Eq. (3.9). 
oO 


3.4.0 Analysis of Mixed-Criticality Networks 


Based on the result proposed in Sect.3.4.1, we extend the idea of a demand 
bound function to a mixed-criticality network. For illustration purposes, only a 
dual-criticality network is considered; this means that € has only two values, LO 
(low-criticality mode) and H 7 (high-criticality mode). Nevertheless, it can be easily 
extended to networks with an arbitrary number of criticality modes. We construct 
three demand bound functions: the demand bound function in low- and high- 
criticality modes (dbfr o(I) and dbfg (I) ) and the demand bound function when 
network mode switches (dbfr o»g;(1)). We analyze dbfy7(l) and dbftLo2H1(D 
under graph routing in this subsection. 

The network begins from the low-criticality level, and all flows are served and 
executed as in a single-criticality network. When errors or emergencies occur, the 
centralized network manager will trigger the switching of the network mode from 
LO to HI. In high-criticality mode, the network turns to graph routing, and the 
flows in the low-criticality level are discarded; only high-criticality flows can be 
delivered. The job that is active (released, but not finished) from a high-criticality 
flow at the time of the switch is still running under source routing; ny, is the 
number of high-criticality flows, and there are no more than ng; carry-over jobs 
that are active at the time of the switch. We define these carry-over jobs as new flows 
Fong 1) F(ngi 2) «++ F2ng;, Which have the same characters as the corresponding 
flows in F except for c and f. For the new flow Fpiny;,Cp > C(ptny,), and as an 
accidental event, t(piny,) > fp. 

When the network switches from LO to H I, the demand of carry-over jobs is 


2ng| 


zx c+ Ml Apa. (3.11) 


p=1+np1 ngrEp.qx2ngi 


3.4 Demand-Bound Function of Industrial Networks 45 


Furthermore, the flows will generate new jobs when the network switches to 
high-criticality mode. Because each node except the destination on the primary path 
generates one backup path, the total number of paths for F, is cp + 1 and the 
execution time for each backup path ck can be obtained from the network easily. 


The total execution time of F; can be denoted as Cy = cp + p» k=l p . Therefore, 
network demand for channel contention under graph routing is 


Ap |i-d 
dbfgi(l) = Z > Im + tbe] . (3.12) 
P 0 


pel 


Based on the rules of transmission conflict proposed in Sect. 3.2.3, a transmission 
conflict between two flows is generated only if there is at least one flow transmission 
on the primary path. Therefore, we analyze dbfy;(/) by studying the transmission 
conflict generated on the primary path. For F A and F7, when given dp < dq, F7 
may be delayed by F E and its backup paths. We denote the path set of Fp and its 
backup paths as 7; each path in J is denoted as p’. The upper bound delay of Fy 


caused by F5 is denoted as A(p'q). A(p'q) can be formulated as 


Cpl &(p'q) 8 (p'q) 
A(p'q) = D5 (Cd) Lenk(p'q) — D> (Lene (p'q) — 3. (3.13) 
p'=1 k=1 k'=1 


For the job on the backup path, a transmission delay occurs only when it conflicts 
with primary paths with high-priority jobs. When we reverse the priority of F x and 
F ge Eq. (3.13) is the upper bound additional demand of F H caused by F d From 
the above, the network upper bound demand function under graph routing can be 
described as 


2 NHI d; 
dbfur) = —5 (= + thal 
i=l 0 


l l 
to» (A(p'q) max{[—], [—19. (3.14) 
P q 


Ixp,qzngui 


We can then obtain dbfr 02g (1) as 


NHI 1 
dbfLoaui(l) = PX | P+ ipe, ze» 
0 


l l 
t) (A(p'q) max{[—1, [—]}). (3.15) 
tp tq 


lxp.qx2ngi 
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Because transmission on a backup path occurs only when the two previous 
attempts fail, when the transmission success rate on the primary path satisfies the 
network packet reception ratio, the sender has no need to send a backup packet. 
Hence, the network upper bound demand function in this case can be rewritten as 


NHI ld: 
dbfroani(l) = - [| : Py thel 
0 


p-i P 


l l 
£0 (A(pq) max{[—1, [—1)). (3.16) 
tp tg 


l<p,qs2ny1 


3.4.3 Tightening the Demand Bound Functions 


A loose demand bound function will lead to a pessimistic estimation of network 
schedulability. In this subsection, we tighten our demand bound functions by 
discussing the relationship between two flows and transmission conflict. 

In our previous analysis Lemma 3.2, the number of conflict jobs is a conservative 
estimation as max{[+1, [z1. However, this value can be reduced by classifying 


discussions. We divide this value into the following categories: 


e f < tj, and di < dj. 
e f < tj, and di > dj. 


When the path of F; and F; have overlaps, they may generate transmission 
conflicts. The delay caused by conflict cannot occur in each slot because the flow 
does not transmit between d and t. Obviously, when one flow works in its ideal time 
(between d and ft), there is no transmission conflict between F; and Fj. 

Condition 1 is shown in Fig. 3.4a; conflict occurs only when both F; and F; have 
job transmissions on the path. For a given /, the number of conflicting jobs can be 
expressed as 


L_ dj 
f—|(L—J] +1). (3.17) 
tj ti 
Similarly, we can obtain the number of conflicting jobs in condition 2 as 
L. dj I 
[—1(—] + D = 2f]. (3.18) 
tj ti tj 


We denote the number of conflicts as Num(ij). When we know each flow's 
routing information, the estimation of Num(ij) can be further precise. By taking 


the modulus of a we can estimate the maximum length of F;’s residual path as 
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Fig. 3.4 Classified 
discussion. (a) Condition 1. 
(b) Condition 2 o o o F; 


i 


(a) 


(b) 


Ig. The delay on this residual path is denoted as w, and we can obtain y as 
follows: 


* If F; has an overlap with F; on this residual path, y = A(|| 4 ||), where A (| [| |) 


is the delay on the residual path whose length is IE ||. 
* If F; has no overlap with Fj on this residual path, y = 0. 


The number of conflicts can be expressed as 
" lo dj 
Nam) MEE ee (3.19) 
J t 


We can then obtain the network demand bound functions as follows: 


Theorem 3.2 In any time interval of length l, the demand bound function in each 
mode can be expressed as 


n 


1 l— di = is 
dbfioQ) — — F Kx + ipa + YD GpNumpy — G.20) 


i=l 0 1<i,j<n 


0 


P Ru peg 1 
dbfro2ui() = = Y + ipe, + x» 
p-l F 
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to, QGQaqNum(pq). (3.21) 


l<p,qS2ny1 


dbfui(l) = — =| z +1 bcp] + SY) (A('g)Num(pq)). 
0 


p=1 l<p,q<2ny1 
(3.22) 


The network demand bound function is dbf (I) = max(dbfro(l), dbfro2ui(), 
dbfgu;(l), and the network can be scheduled when dbf(l) is no less than 
min {dbfLo (l), dbfro2nui1(D]. 


Proof 'The proofs of demand bound functions are similar to in Theorem 3.1. The 
difference is that we reduce the number of conflicts by classifying the discussion, 
and the demand bound functions are tightened. Because there are carry-over jobs 
at the switching time, dbfro»g;(l) must be greater than dbfg;(l). When the 
network supply in a time interval of length / sbf (I) is larger than dbfio(J), the 
network can be scheduled in low-criticality mode; when dbfroog;(l) < sbf (D < 
dbfy o(l), the network can be scheduled in high-criticality mode; when sbf (I) > 
maxí(dbfrz o(l), dbfro»gu1(1)), the network cannot be scheduled. Hence, the net- 
work can be scheduled when dbf (L) is no less than min(dbfr o (D, dbfr o2g1()]). 
I 


3.5 Performance Evaluations 


In this section, we conduct experiments to evaluate the performance of our proposed 
methods. Our method is first compared with the simulation result. We then compare 
our method with the supply/demand bound function analysis without tightening. 
To illustrate the applicability of our method, for each parameter configuration, 
several test cases are generated randomly. For each test case, the network gateway 
is placed at the center of playground area A, and the other nodes are deployed 
randomly around the gateway. According to the suggestion in [14], given the 
transmitting range d — 40m, the number of nodes n and the playground area A 
should satisfy 
n 20 


A yI 


If two nodes can communicate with each other, which means that the distance 
between two nodes is less than d, they are adjacent nodes. By repeatedly connecting 
the nearest node from the source node to the gateway, the network topology can be 
obtained. If some source nodes cannot connect to the gateway, their locations are 
generated randomly again. 


(3.23) 
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Fig. 3.5 Relationship between demand bound functions and supply bound function 


Our simulations use the utilization u to control the workload of the entire 
network. To make flow sets available, we specify the network utilization U = 
X u;(U < 1), and the UUniFast algorithm [15] is used to generate each flow’s 
utilization u; (u; — $). The result generated by the UUniFast algorithm follows a 
uniform distribution andi is neither pessimistic nor optimistic for the analysis [15]. 

Figure 3.5 is an example of the relationship between the demand bound function 
in different criticality modes and the supply bound function. In this example, 
according to the actual situation, we set the number of nodes as n = 70 and 
the number of flows as F = 20. At the beginning, with the network running in 
low-criticality mode, the demand is zero. At time slot 5, DBFLO is 72, which 
is larger than the upper bound of network supply; the network then switches to 
high-criticality mode. Considering carry-over jobs, we can calculate the demand 
in high-criticality mode from time slot 5. Because the network demand is less 
than the supply, this example is a stable network. Furthermore, Fig.3.5 reveals 
that the demand bound functions are stepwise increasing. This is because dbf (1) 
is the network demand over a period of time. When a job has enough time slots 
to transmit (e.g., a job is just released), its demand is zero and does not require 
immediate execution. With the decrease of the remaining time, the job becomes 
urgent. When the remaining time for the job is c, the job must be forwarded 
immediately; otherwise, it will miss the deadline. The job demand is then changed 
to the number of hops c. 

Figure 3.6 is the variation tendency of DBF HI with the proportion of high- 
criticality flows. Because changing the proportion of ny, does not affect network 
demand in low-criticality mode, Fig.3.6 shows the network demand only in high- 
criticality mode. Obviously, the network demand is increasing with the increasing 
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demand 


0.4 0.5 0.6 0.7 0.8 0.9 
the proportion of high-criticality flows 


Fig. 3.6 Variation tendency of DB F HT with the percentage of high-criticality flows 


proportion of high-criticality flows. At the beginning (0.4—0.6), the network demand 
increases slowly. From 0.7—0.9, the demand of the network increases rapidly. This 
is because more flows in high-criticality mode generate more transmission conflicts 
in conditions 1, 2, and 4. The network needs more resources to ensure that the job 
meets its deadline. This phenomenon is enhanced severely with increasing P. 

To analyze the correctness of our method, we compare the network schedulability 
ratio between the simulation result (denoted as MixedSim) and our method (denoted 
as MixedEDF) in Fig. 3.7. For each point in the figures, more than 100 test cases 
are randomly generated. From the figures, we can know that our algorithm can 
accurately evaluate the network schedulability ratio regardless of which parameters 
are used. Because we pessimistically estimate transmission conflicts to guarantee 
our method's reliability, the evaluation value of the network demand bound is larger 
than the actual demand. In Fig. 3.7a and b, the proportions of high-criticality flows 
are P = 0.4 and P = 0.5, respectively. With the increasing of nodes, the network 
schedulability ratio declines in both situations. However, the schedulability ratio in 
Fig. 3.7b falls faster than in Fig .3.7a. This is because the network generates more 
transmission conflicts when increasing the number of high-criticality flows. Note 
that compared with Fig. 3.7a, Fig. 3.7c has 0.1 additional utilization, so the spacing 
between the simulation curve and analysis curve is expanded. Although there are 
fluctuations between 30 to 60, our method can always bound the schedulable ratio 
(the fluctuations are caused by the randomly generated network environment). 
Because the two figures generate test cases according to the respective utilization, 
their test cases are different. When network utilization increases, the number of 
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Fig. 3.7 Relationship 
between schedulability ratio 
and the number of nodes. (a) 
U = 0.5, P = 0.4. (b) 

U = 0.5, P = 0.5. (c) 

U —0.6,P =0.4 
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Fig. 3.8 Relationship between schedulability ratio and the proportion of high-criticality flows 


hops from source to destination increases. This increases the number of potential 
conflicts. The estimation result then becomes more pessimistic. 

Figure 3.8 is the relationship between the schedulability ratio and the proportion 
of high-criticality flows. It is easy to understand that the schedulability ratio 
declines with the increasing proportion of high-criticality flows. However, the 
spacing between the two curves changes with P (small-big-small). This is because 
our method should consider the transmission conflicts in all situations to ensure 
reliability. In the beginning, there are only a few conflicts in high-criticality mode. 
With increasing high-criticality flows, the strict estimation considers each path 
overlap as a transmission conflict, which leads to larger spacing between two curves. 
When P = 0.7, the number of conflicts increases in MixedSim, which reduces the 
schedulability ratio, and then the difference becomes small. 

We illustrate the advantage of MixedDBF by comparing it with the sup- 
ply/demand bound function analysis without tightening (denoted as MixedDBF-nt) 
in Fig. 3.9. Obviously, MixedDBF is better than MixedDBF-nt regardless of the 
conditions. With increasing network utilization or proportion of high-criticality 
flows, the error of MixedDBF-nt grows faster than MixedDBF. The reason is that 
both increasing network utilization and the number of high-criticality flows will 
increase the number of path overlaps. MixedDBF tightens the delay caused by the 
transmission conflict by Eq. (3.19). With increasing overlaps, the effect of Eq. (3.19) 
will be better. Hence, the error of MixedDBF-nt grows faster than MixedDBF. 


3.5 Performance Evaluations 


Fig. 3.9 Schedulability 
comparison among 
MixedSim, MixedDBF, 
MixedDBF-nt. (a) 

U = 0.4, P = 0.2. (b) 
U = 0.5, P = 02. (c) 
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3.6 Summary 


WirelessHART adopts reliable graph routing to enhance network reliability. How- 
ever, graph routing introduces substantial challenges in analyzing the schedulability 
of real-time flows. Too much transmission load will increase conflicts and reduce 
network performance. Disaster may happen when critical tasks miss their deadlines 
in this situation. Hence, firstly, we propose a novel network model that can switch 
routing based on the criticality mode of networks. When errors or accidents occur, 
the network switches to high-criticality mode and low-level critical tasks are 
abandoned. Secondly, we analyze the demand bound of mixed-criticality industrial 
wireless sensor networks under the EDF policy and formulate network demand 
bounds in each criticality mode. Thirdly, we tighten the demand bound by analyzing 
carry-over jobs and classifying the number of conflicts to improve analysis accuracy. 
The simulations based on random network topologies demonstrate that our method 
can estimate network schedulability efficiently. 
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Chapter 4 ff) 
Mixed-Criticality Scheduling for TDMA iss 
Networks 


Abstract To improve the schedulability of mixed-criticality industrial wireless 
networks, targeted algorithms should be developed. Therefore, in this chapter, we 
propose a mixed-criticality scheduling algorithm. The algorithm supports central- 
ized optimization and adaptive adjustment. It can improve both the schedulability 
and flexibility. We conduct extensive simulations, and the results demonstrate that 
the proposed scheduling algorithm significantly outperforms existing ones. 


4.1 Background 


Since time division multiple access (TDMA) scheduling has the high predictability, 
it is widely used in industrial networks [1—3]. In mixed-criticality industrial 
wireless networks, when there are not enough resources for all data packets, the 
low-criticality data packets have to be discarded. Hence, almost all of mixed 
criticality systems must support discarding strategies [4—7]. Previous works on 
single-criticality industrial wireless networks apply centralized TDMA methods to 
guarantee the real time performance and reliability of industrial wireless networks, 
e.g. [8-13]. However, the centralized TDMA methods are inflexible and difficult to 
cope with discarding. 

Intuitively, two types of methods can be used to schedule data flows in mixed- 
criticality wireless networks. The first type is to schedule flows based on criticality 
monotonic priorities. The criticality monotonic scheduling assigns the higher 
priority to the important flows and schedules them first. However, this method 
considers the criticality as the temporality. Actually, they are not equivalent. Thus, 
the criticality monotonic scheduling algorithm is not suitable for mixed criticality 
systems. This has also been demonstrated in [14]. The second type is to use the 
algorithms that have been proposed for previous mixed-criticality systems, such as 
uniprocessor/multiprocessor systems [15-17] and networks [18-20], to solve our 
problem. However, industrial wireless networks are different from the previous 
systems. To guarantee the strict requirements on the real-time performance and 
reliability, the main problem to be solved is how to avoid the collision and inter- 
ference between parallel data flows. Mixed-criticality uniprocessor/mulitprocessor 
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systems only consider independent processors and do not have the interference 
between parallel tasks. Mixed criticality wired networks and IEEE 802.11-based 
wireless networks are based on CSMA (Carrier Sense Multiple Access) protocols, 
which are unacceptable by industrial wireless networks due to the unpredictability 
(We give more clarifications on the differences between our system and others 
in Sect. 4.2). Therefore, previous algorithms cannot be used without modification 
in mixed criticality industrial wireless networks. In this chapter, we present a 
holistic scheduling solution to guarantee the real time and reliability requirements 
of data flows in resource-constrained industrial wireless networks. Although some 
flexible and scalable MAC protocols [21, 22] are adopted to improve the real-time 
performance and reliability of networks, they are based on only local information 
and cannot optimize the whole network. Therefore, our scheduling method is 
implemented in the application layer. According to the generated schedules, each 
network node transmits or receives packets in the MAC (Medium Access Control) 
layer. The scheduling method of the application layer can manage all data flows 
based on global information. Thus, it can get the optimized solution. 

This chapter includes the following: 

First, we propose a scheduling algorithm for mixed-criticality networks. The 
scheduling algorithm not only implements the optimized global management for 
all flows, but also reserves network resources for dynamic adjustments to enhance 
the real time performance and reliability of important flows. It makes a trade-off 
between the scheduling performance and the flexibility. Performance evaluations 
demonstrate that the proposed scheduling algorithm outperforms existing ones. 

Second, we present a schedulability analysis for the proposed scheduling algo- 
rithm. We analyze end-to-end delay for flows, and determine whether they are 
all schedulable. Simulation results show that our schedulability analysis is more 
effective than existing ones. 


4.2 System Model 


Industrial wireless networks must support the strict requirements on real time 
performance and reliability. Therefore, we consider an industrial wireless network 
as follows. It consists of a gateway and some devices. We use the node set N = 
(n1, 12,...) to denote these nodes. The physical layer of our industrial wireless 
networks is specified by the IEEE 802.15.4 protocol. It supports 16 non-overlapping 
channels. However, due to external interference, not all of them can be accessed all 
of the time. We denote the number of available channels as M (1 < M < 16). Our 
network serves the flow set F = ( fi, fo,...}. Each element f; is characterized by 
< 7;, i, xi >. Each flow f; periodically generates a packet at its period 7;, and 
then sends it to the destination via the routing path II;. The relative deadline of each 
packet is equal to the period 7;, i.e., a packet is released at the time f, and it must 
be delivered to its destination before the time (t + T; + 1). In industrial wireless 
protocols, e.g. [23, 24], periods conform to the expression 
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bx (4.1) 


where a is an integer value and b is the unit-period. 

To keep consistent with related works on mixed criticality systems, our network 
also supports two criticality levels, L-crit (Low criticality) and H-crit (High criti- 
cality). The dual-criticality model can be easily extended to multi-criticality model. 
If the flow f; is important, its criticality level x; is denoted as H. Otherwise, its 
criticality level x; is L. When the system is running in the normal mode without any 
exception, all flows are delivered to their destinations within deadlines. If important 
equipment has an exception, the corresponding data must be submitted frequently 
and via two paths to avoid faults on a single path. Thus, in our system model, the 
H-crit flows have two parameter sets: the L-crit parameters < T;(L),II;(L) > 
in the normal mode; the H-crit parameters < 7;(H), 0;(H) > in the exception 
mode, and T;(H) < T;(L). II; (L) is a path that is used by the H-crit flow in 
the normal mode. II;(H) contains two paths that are used by the H-crit flow 
in the exception mode, and the two paths transmit the same packet to improve 
the reliability. In order to clearly distinguish these paths, they are denoted as 
II;(L) = (77) and I; (H) = (7;, 7; ). The path zr (and z;, 7) is the set of links 
from the source to the destination. In this chapter, we do not consider how to select 
routing paths. We assume all paths have been given before generating schedules. The 
dynamism this chapter addresses refers to using different parameters in different 
modes. Transmitting a packet through the j-th link of the path ** (or zr, 7) is 


called as the transmission tř (and T. tj;)- Each transmission has two attributes 


< ng,ng >, Which denote the transthission’s source and destination respectively. 
As the constrained resources must provide enough services to H-crit flows, the L- 
crit flows cannot be transmitted when exceptions happen. Therefore, L-crit flows 
only have a parameter set < T; (L), M; (L) >. 

To improve the reliability of industrial networks, we adopt the TDMA scheme 
in the MAC layer. The network manager, which is connected to the gateway, 
assigns a time slot and a channel offset to each transmission. A transmission only 
is scheduled at the given time slot and on the given channel offset. Packets are 
generated periodically, and the schedules of corresponding transmissions have the 
same period. The schedules with the same period are organized within a superframe 
[24]. Transmitting a packet from the source to the destination has to be done in a 
superframe. Thus, superframes repeat themselves periodically, and then flows can 
be transmitted successfully. Figure 4.1a shows a simple network, which contains 
two flows fı and f2. When the system is in normal mode, the flows use their L- 
crit parameters. Their periods are 8 time slots and 4 time slots, and their paths are 
{e52, e21} and (eos, €87, €74, €41}, where e;; denotes the link from the node n; to the 
node nj. Figure 4.1b shows their superframes with different periods. CH and T S 
denote Channel Offset and Time slot. 
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Fig. 4.1 Graph routing and superframe. (a) A network. (b) Superframes with different periods. 
(c) A hyper-frame. (d) The flow f» steal slots from the flow fi 


Two types of improper schedules will lead to transmission interference, which 
seriously affects the network reliability. The first type, called node interference, is 
that more than one transmissions uses the same node at the same time slot. Each 
node is only equipped with one transmitter. Therefore, one node cannot serve more 
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than one transmissions at the same time. The second type is called scheduling 
interference which means that more than one transmissions is scheduled at the 
same time slot and on the same channel. These overlapping transmissions cannot 
be separated. To avoid transmission interference between different superframes, 
we consider all superframes as a hyper-frame whose period is the lowest common 
multiple of all superframes. According to the period's Expression (4.1), the hyper- 
period 7 = LCM(Ti, T5,...) = der ioo Figure 4.1c shows the hyper-frame 
i€ 


of the simple example. We only consider how to schedule flows in the first hyper- 
period, since after that, all schedules are repeated periodically. The network manager 
generates all schedules under two situations: Situation 1: when the network is 
deployed; and Situation 2: when the deployment is changed. Due to the requirement 
of industrial applications being fixed, the deployment is not often changed. Thus, 
the schedules may be generated several times, but not frequently. According to this 
schedule information, it obtains the working modes of each node at every time slot, 
and then delivers them to the corresponding nodes. For the schedules in Fig. 4.1c, 
from T S1 to T S4, working modes of the node n3 are (receive, send, idle, idle}. 

When a node intends to send a transmission of L-crit flows, it waits for a 
constant time and then listens to whether its channel is used. If the channel is 
used by H-crit flows, the node discards its transmission. Otherwise, the node sends 
the transmission. Note that although the node uses the carrier sense technique to 
determine whether an L-crit transmission is discarded or not, it is different from the 
CSMA scheme. For L-crit flows, the node performs carrier sense within time slots 
of the TDMA frame. If the L-crit transmission is not discarded, it is also scheduled 
based on the TDMA scheme. When a node intends to send a transmission of H- 
crit flows, it immediately sends it at the beginning of the assigned time slot. The 
scheduling algorithm assigns the proper time slot and channel for each transmission 
and prevents H-crit transmissions from interfering with other H-crit transmissions. 
Therefore, H-crit transmissions are sent directly without checking the channel. In 
this way, the H-crit flow can steal slots from L-crit flows when it needs more 
resources to cope with exceptions [25]. Note that the H-crit flow using H-crit 
parameters is not permitted to steal slots that are used by any other H-crit flows 
even if these H-crit flows are using L-crit parameters. Figure 4.1d shows an example 
of mixed-criticality schedules. The period of the H-crit flow fı is changed from 8 
to 4, and the new path (ese, €63, e31) begins to be used. In this case, there are not 
enough time slots. The H-cirt transmission 3 — 1 (the solid line in Fig. 4.1d) steals 
the resource of the L-cirt transmission 7 — 4. Based on the stealing strategy, the 
dynamic adjustment can be supported. 

The schedulable flow set is defined as follows. When the system is in the normal 
mode, the flow set is schedulable if all flows characterized by L-crit parameters 
can hit their deadlines. When there are exceptions in the system, the flow set is 
schedulable if all H-crit flows can hit their deadlines no matter which parameters 
they are using. 
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4.3 Problem Statement 


Based on the above system model, we describe the mixed criticality scheduling 
problem as follows. Given the network and the flow set F, our objective is to 
schedule transmissions in the time slot and channel dimensions such that the flow 
set is schedulable. 

To explain the problem more clearly, we formulate the problem. as a Satisfiability 
Modulo Theories (SMT) specification. The transmission U. ; (and T; ij Ti Tj.) i is assigned 
the s; jh (and s; jth, 8 ;-th) time slot and the r}; -th (and r; th, rj ' -th) channel offset. 
Note ‘that a transmission is scheduled periodically; Therefore, the transmission uses 
all of the time slots s;; + g - T; (Vg € [0, T EA in a hyper-frame. These assignments 
must respect the following constraints. 


(a) Channel Offset Constraint. 
Vf,Vjel[lI?llxr5hzM 


For each transmission, its assigned channel offset must be in M available chan- 
nels. This expression is for transmissions in the path zr. Other transmissions 
f T and Ur in paths zr; and zr’ have the same constraint, and we omit them for 
simplicity. 


(b) Releasing Sequence Constraint. 


In a routing path, the transmission rj, j+1 is released after the transmission rj, j 
is scheduled. We still omit paths 7r; and zr. 
(c) Real Time Constraint. 


Vfi l < Siig) S TQ) 


All transmissions cannot miss deadlines. Likewise, 5; ixi and s; | have the 
NU 


same constraint. 

(d) Interference Constraint. Assigning resources to transmissions must pre- 
vent the happening of node interference and scheduling interference. We use 
ô(Ta, tp) to denote whether there exists interference between Ta and Tp, 


8 (Ta, tp) = (ta N Ty = Ø)? (Ga. Sb) ^ (ra = rp)) : na, Sb), 


where n(sa, Sb) = V (sa + h - Ta = sp + k - Tp) means whether 
Vhe[0, 37). Vke[0. $) 

the assigned time slots of tz and T, overlap each other. If the two transmissions 

do not use the same node, i.e., Tg N Tp = Ø, then they can be scheduled at 

different time slots or on the different channel offsets. Otherwise, there exists 

node interference and they cannot be scheduled at the same time slot. The 
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transmissions of the H-crit flow f; are classified into three sets 7? = UP Vje 
[1, jaz] D, T; = Gv € [1, [75] and DY = GV € [1, [17]]]. For the 
L-cirt flow fi, I; = T? = Ø, and then Vf; € F,T; = Tf UT) U TY. Thus, 
the interference constraints in the normal mode and exception mode are as 
follows. 


(d.1) Normal mode 


Vu. ue M TŽ, 8(ta,) —0 
Vf,eF 


(d.2) Exception mode 


Vfi, fg € F, xi = xs = H, Vt, € lj, Vt € Vg, (14, t5) = 0 


The mixed criticality scheduling problem is NP-hard [26]. Our SMT specification 
can be solved by some solvers, such as Z3 [27] and Yices [28]. These solvers 
can find satisfying assignments for quite many problems, and their solutions have 
been an excellent standard to evaluate the effectiveness of other methods [29]. 
However, the running time may be unacceptable for complex networks and flow 
sets. Therefore, we propose a heuristic scheduling algorithm in Sect. 4.4 to solve the 
problem. 


4.4 Scheduling Algorithm 


In this section, we first introduce how to schedule transmissions, and then, based on 
these schedules, we determine working modes of each node at every time slot. 


4.4.1 A Slot-Stealing Scheduling Algorithm 


We propose a slot-stealing scheduling algorithm based on RM (StealRM). The 
proposed StealRM optimizes the solution according to the global information, and 
permits transmissions to share the same resource when the transmissions have 
different levels of criticality. Hence, the schedules can be adaptively adjusted based 
on the requirements of H-crit flows. 

The proposed StealRM is shown in Algorithm 4.1. Each flow is assigned as the 
RM priority. If two flows have the same RM priority, the flow with the smaller ID 
has the higher priority. The transmission's priority is equal to its flow's priority. 
The set R contains all of schedulable transmissions (lines 1 and 19), and the set R’ 
denotes released transmissions at the current time slot (line 3). At every time slot f, 
we first sort elements of R’ according to the decreasing order of priorities, and rj 
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in the set R’ has the highest priority (line 4). Then, for each transmission Ta in the 
set R’, we check whether it can be scheduled at the current time slot without any 
interference (lines 7-23). Let F (75) denote the flow that the transmission Ta belongs 
to (line 6). The set p L contains the transmissions that have been scheduled at the 
time slot t and belong to H-crit flows with L-crit parameters. The sets y” and 
y correspond to those in H-crit flows with H-crit parameters and L-crit flows, 
respectively. The transmissions in the set Y’ and the transmission Ta cannot steal 
slots from each other. According to the criticality level of tg, the set Y’ is assigned 
different transmissions (lines 7—13). If the transmission Ta belongs to an H-crit flow 
with H-crit parameters, then it cannot steal slots from other H-crit transmissions 
(lines 7-8). Y and Y "L may contain the transmissions belonging to the same flow 
with Ta. These transmissions do not interfere the scheduling of ta. Thus, the set 
(vir) needs to be excluded from Y " and Y "^ (line 8). Similarly, if the transmission 
Tq belongs to an H-crit flow with L-crit parameters, then it cannot steal slots from 
any other transmissions (lines 9 and 10). If the transmission Tą belongs to an L- 
crit flow, then its slots cannot be stolen by L-crit flows and H-crit flows with L-crit 
parameters (lines 11 and 13). When there is no node interference between Ta and Y’, 
and at least one channel is idle (line 14), the transmission Ta can be scheduled at this 
current time slot. 9 (Y^) denotes the channels that have been used by Y’. However, 
if the current time slot has exceeded its deadline, the flow set is unschedulable 
(lines 15 and 16). Otherwise, the time slot and channel offset of the transmission 
Tq are assigned (line 18), and the schedulable transmission set R and the scheduled 
transmission set ý” (YE and y L) are updated (lines 19-26). 

The number of iterations of the for loop in line 2 and the for loop in line 5 is 
O(|T]) and O(|I |), respectively. The complexity of line 4, line 14 and line 21 is 
O (|l |Zog|T|), OCT |) and Og). respectively. Therefore, the time complexity of 


: : VE 
Algorithm 4.1 is O(|7 |||" 7—). 


4.4.2 Node Working Mode 


Nodes have three working modes, including transmit mode (S), receive mode (IR) 
and idle mode. We use wä ; =< S (or R), ra > to denote that at the time slot t 
the node ng transmits (or receives) H-crit flows on the channel ra. Similarly, wg ; 
denotes that the node ng serves L-crit flows. Algorithm 4.2 determines the working 
mode for each node. For each transmission, we have assigned a time slot and a 
channel offset in Algorithm 4.1. According to the assignments, the working modes 
of the sender node and receiver node of the transmission can be ebuaied (lines 
between 4 and 10). The time complexity of Algorithm 4.2 is O(| pis To 

Note that a node may serve two flows at the same time slot, but the two flows 
must have different criticality levels. Otherwise, node amare occurs. At the 
beginning of the time slot t, the node works in mode wi ,; Then, in a constant time 
if it needs to send an H-crit flow or has received a flow? it continues working in the 
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Algorithm 4.1 StealRM 


Input: the flow set F 

Output: the scheduling results Vs; and Yra; 

1: the schedulable transmission set R <— {tă Tj HIY fi € F}; 
2: for Yt € [1, T] do 

3: R' < R; 


4: sort R’ according to the decreasing order of priorities; 
5: for each a from | to |R’| do 
6: i < F (Ta); 
7: if x; == H and t € T; UT? then 
i H A 
8: Ye  U. Gingpna U YI ana — (tig 
Vhel[0. 7G) 
9: else if x; == H and Ta € I7 then 
: L H HL : 
10: Y C U F Ying U z Yr ana M Yee ana) - (tjs. Tigh 
vhel0, 7475) vhel0, rtr) 
11: else 
: L HL ; 
12: Y —( U " YT (xh) U( U » YT, (xn? 
Vhe[0. ri) vhel0, rr) 
13: end if 
14: if A (TaN Tp zz 0) and |O(Y’)| < M then 
VmeyY’ 
15: if t exceeds the deadline of f; then 
16: return unschedulable; 
17: end if 
18: Sq «— t; ra «— a random channel that is not in O(Y^); 
19: R < R — {tq}+ the next transmission of Ta; 
20: if x; == H and t, € b U T7 then 
21: Vh € [0, unir LT ODxh — YE nxn + Uab 
22: else if x; == H and Ta € I7 then 
s HL HL è 
23: Vh € [0, TI» Y Rh < YTO xh + {Ta}; 
24: else 
: T L L A 
25: Vh € [0, T Yainwxh € Y nayxh + {ta}; 
26: end if 
27: end if 
28: end for 
29: end for 


30: return Vs, and Yra; 


same bui at this time slot. Otherwise, it works in mode wL,. However, when its 
mode wL , is S, it must determine whether the assigned canus is clear or not before 
it sends hie flow. If the channel has been occupied by H-crit flows, the flow has to 
be discarded. The switch time between different modes is very short compared with 
a time slot. For example, the switch time of the transceiver CC2420 is just 200 us 
while a time slot is 10 ms. Generally, at a time slot, most nodes only serve one flow 
or are idle, while only a few nodes serve two flows. 
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Algorithm 4.2 Working mode 


Input: the scheduling results Vs; and Vr; 
Output: all we, and wi, 

1: all wh, and wh, are initiated as idle mode; 
2: for Yta € T do 


3: i «— F (14); < ng, ng > are the sender and receiver of Ta; 
4: if x; == H then 
: T: H x 
5: Yh € [0, TE) Wa sa+T;(H)xh doce S, Ta >; 
í E H : 
6: Vh € 10, T) s. enanxs € 9 Ra >; 
EE else 
7 T L * 
8: Vh € [0, TOI We sat Ti (L)xh << S, ra >; 
" T L 5 
9: Vh € [0, TI WB sa T; (L)xh << R, ra >; 
10: end if 
11: end for 


12: return all wL, and wi; 


4.5 Scheduling Analysis 


In this section, we analyze the worst case end-to-end delay for each flow and use the 
delay to test the schedulability of the flow set. If the worst case delay of all flows 
does not exceed deadlines, the flow set is schedulable. For the sake of simplicity, we 
first explain how to compute the worst case delay in single-criticality networks (in 
Sect. 4.5.1) and then extend it to mixed-criticality networks (in Sect. 4.5.2). 


4.5.1 Analyzing Method for Single-Criticality Networks 


Besides transmitting time, the end-to-end delay is introduced by the interference 
from higher priority flows. Therefore, in Sect. 4.5.1.1, we present the analyzing 
method of the total interference. In Sect. 4.5.1.2 we distinguish the different types 
of interference and compute the worst case delay. 


4.5.1.1 Total Interference 


During the time interval between the release and completion of the flow fp, all 
the active transmissions that belong to the higher priority flows may have node 
interference or scheduling interference to the flow fg. Therefore, in the worst case, 
the total interference is equal to the number of those higher-priority transmissions. 
The method of computing the workload in a period has been proposed in multi- 
processor systems [30]. The mapping between the multiprocessor system model 
and the network model has been explained in the work [31], in which a channel 
corresponds to a processor and a flow is scheduled as a task. Therefore, we propose 
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our analyzing method based on the work [30], which is the start-of-the-art analysis 
for multiprocessor systems. To make this chapter self-contained, we first simply 
introduce the method of multiprocessor systems, and then present our method. 

For the simplicity of expression, the multiprocessor system uses the same 
notations as our network model. For multiprocessor systems, the calculation of the 
worst case delay of the task fg is based on the level-k busy period (as shown in 
Definition 4.1). 


Definition 4.1 (Level-k Busy Period for Multiprocessor Systems) The level-k 
busy period is the time interval [fo, tk), in which f, is the finish time of the task 
fr, and to satisfies the following conditions: 


1. to < t, where t, is the release time of the task fk. 

2. Vt € [to, t,], at the time f, all processors are occupied by higher-priority tasks. 

3. Vt < to, 3t € [t, to], at the time fr’, at least one processor is occupied by lower- 
priority tasks. 


If there is no fo that satisfies all conditions, then tọ = t». 


The level-k busy period is determined by the workload of all higher-priority tasks. 
The set P( fg) contains the tasks with higher priority than the task fg. If the task 
fi (fi € P(fx)) has a job that is released earlier than the level-k busy period and 
its deadline is in the busy period, then the task f; has the carry-in workload in the 
level-k busy period. Otherwise, the task has no carry-in workload. The two types of 
workload are presented as follows, and the length of the level-k busy period is x. 


1. In the level-k busy period, if the task f; has no carry-in workload, the upper 
bound of its workload is 


NC = ; 
Wi (fi. x) = =| -ci + min(x mod T;, ci}, 
i 
where c; is the execution time of the task fj. 
2. If the task f; has the carry-in workload, the upper bound of its workload is 


max{x — c;, 0} 


WE! (fi, x) -| 7 


[atara 


where œ = min(max(max(x — cj, 0) — (T; — Di), 0}, cj — 1} and D; is the worst 
case delay of the task fj. 


Based on the upper bounds of workload, two types of interference of the task f; 
to the task f; are as follows: 


IP (fi, x) = min(max(W/" € (fi, x), 0), x — cx + 1), 


IC! (fj, x) = min(max(WE! (fi, x), 0), x — cy + 1). 
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Therefore, the total interference suffered by the task fg is 


Q(x, PAA), PC (fy) ' Pot , mum 


Vie PNC( fy) Vfie PCI (fj) 


where PN Cc’ fk) and pc! (fx) denotes the set of tasks without carry-in workload 
and the set of tasks with carry-in workload, respectively. In a busy period, at most 
M — 1 higher-priority tasks have carry-in workload. Therefore, the set PC? contains 
M — 1 tasks that have maximal values of IE x)— De ( fi, x). Other tasks are 
in the set PN, 

In the following, we propose our analyzing method. Industrial wireless networks 
apply strict periodic schedules based on superframes, which can reduce system 
complexity and run time overhead. While in multiprocessor systems and previous 
works about wireless networks, schedules are variable, i.e., the assigned time slots 
to a task (or a flow) are non-periodic, so our workload bounds are not the same as 
previous ones. Our workload bounds are computed with Theorem 4.1. Definition 4.2 
defines the level-k busy period in the network. 


Definition 4.2 (Level-k Busy Period for Networks) The level-k busy period is the 
time interval [to, tg), in which f; is the finish time of the flow fg and to satisfies the 
following conditions: 


1. fo < t, where t, is the release time of the flow fk. 

2. Vt € [to, t,], at the time f, all channels are occupied by higher-priority flows or 
there exists node interference between the scheduled flows and the flow fx. 

3. Vt < to, at’ € [t, to], at the time 7’, there is no node interference and at least one 
channel is occupied by lower-priority flows or idle. 


If there is no fo that satisfies all conditions, then to = t». 


Theorem 4.1 The workload bounds can be computed with 


X p 
WACC, x) = WEC, x) = BH -ci + min(x mod T;, ci}, (4.2) 
i 
where cj is the number of hops in the path mij, i.e. cj = |mi]. 


Proof of Theorem 4.1 The computation of the non-carry-in workload ww CCR x) 


is shown in Fig. 4.2a. There are E3 complete periods and a scheduling window 


(x mod 7;). In the scheduling window, at most c; workloads exist. Therefore, the 
expression of the non-carry-in workload is shown as Eq. (4.2). 

In the following, we compute we 1 as shown in Fig. 4.2b. The notations A and B 
denote the two incomplete periods, respectively. We know that A < T;, B < T; and 
A--B- (x mod 7;) or (x mod T; + T;). We discuss the two cases as follows. 


Case 1: A+ B =x mod T; We draw out the windows A and B in Fig. 4.3a. We 
consider four different value ranges of the windows A and B as shown in Table 4.1, 
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(a) 
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k— A >| -B >l time 
le x 
(b) 


(a) (b) 


Fig. 4.3 Computation of we. (a) A-- B — x mod TZ;. (D A-- B — x mod T; +T; 


in which if A > T; — Dj and B > Dij, it is Case 2. If A < T; — Dij, then there is no 
workload in A. If B > D;, then all execution time c; must be the available workload. 
In this case, the workload can also be expressed as min(B, ci}. Therefore, only if 
A < T; — Di, the workload is min(B, cj). If A > T; — Dj and B < Dij, the time 
interval 7; — Dj does not contain any workload. Therefore, the available window 
A + B is equal to (x mod 7;) — (T; — Dj). 


In Case 1, we can get that the total workload is 


x C1 43 
PES z (4.3) 


i 


The notation C1 denotes the workload in the incomplete period as shown in 
Table 4.1. It is equal to min{ B, cj] or min(x mod 7; — (T; — Dj), ci}. 


Table 4.1 The workload in Workload! A < T; — D; | A > T; — D; 
the incomplete period under - | 2 

different value ranges of A B> Di |i | Case? — 

and B B < Di |min(B,cj] |min(x mod T; — (T; — Di), ci} 


CI min{B,c;} |min(x mod T; — (T; — Dj), ci} 
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Case2: A+B =x mod T;-- T;, which is shown in Fig.4.3b In this case, there are 


[un | complete periods. In the windows A and B, at most c; + min(x mod 7;, ci} 


i 


workloads exist. Therefore, the workload of Case 2 is 


x — Ti . 
T -ci +c; +min{x mod T;, ci} 


i 


> =| .cj + min{x mod T;, ci}. (4.4) 
i 

Comparing with Eqs. (4.3) and (4.4), the upper bound of workload is Eq. (4.4). 

Since (x mod 7;) is not less than B and (x mod 7;) — (T; — Dj), equation (4.4) is 

the same as Eq. (4.2). The theorem holds. Hn 


Due to the two types of workload having the same computing formula, we do not 
distinguish them in the following and use Wi ( fj, x) to denote them. Based on the 
workload bound, the interference of the flow f; to the flow fx is 


I Cfi, x) = min{max{W; (fj, x), O}, x — ck + 1). 


Thus, the total interference suffered by the flow f; is 


Qietal P) = b (fi, x). 


Vfie PC) 


4.5.1.2 Worst Case Delay in Single-Criticality Networks 


Q(x, P(fx)) and $2 (x, Pf) denote node interference and scheduling interfer- 
ence suffered by the flow f; in the level-k busy period. If there exists a node 
interference at a time slot, the flow fg cannot be transmitted at this time slot, no 
matter how many channels are idle, i.e., the flow fy is delayed one time slot due 
to the node interference. However, only when M transmissions are scheduled at a 
time slot, does the flow fg suffer scheduling interference and is delayed for one 
time slot. In the worst case, all the node interference and scheduling interference 
will introduce a delay to the flow fp. Therefore, the worst case delay is 


(4.5) 


z Q (x, P 
^ Gs, (fg) + EE Fa 


M 


From Eq. (4.5), we know that node interference introduces more delay. Since the 
sum of node interference and scheduling interference is Qo! aly, P(fx)), so when 
as much as possible node interference occurs, the end-to-end delay is the worst case. 
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The upper bound of node interference introduced by h consecutive hops of the 
flow f; to the flow f; is computed as 


Rk i(h)= max = {|{tiy|Vtiy, y € [a, a +h], ITgz such that Tiy N Tkz A 0|]. 
Vae[l,cj—A] i i i 


Thus, the workload introduced by transmissions that have node interference is 
x . 
Wifi. x) = H : Rk ilci) + Rei (min(x mod 7;, c;}). 
I 
Then, 
It Cfi, x) = min(max(Wy (fi, x), 0), x — ck + 1}, 
and 


Qt, PO) D> Hi». 


VfiEP (fa) 


Then, we can get that the worst case delay of the flow fp in the single-criticality 
network is 


total D _on D 
Dy = Nx, PU) + pee m 


From the definition of the level-k busy period, we know that the length x is the upper 
bound of the delay D; (shown in Theorem 4.2). 


Theorem 4.2 For the flow fx and the level-k busy period, the following holds: 
x > Dy. 


Proof of Theorem 4.2 We assume by contradiction that x < D,. From the 
definition of the level-k busy period (Definition 4.2), we know that the finish times 
of the busy period and the flow fp are the same, and tọ must be less than (the first 
condition) or equal to t, (when fo does not satisfy at least one condition). If x < Dy, 
then f, < fo as shown in Fig. 4.4. It is not consistent with the definition. The above 
assumption does not hold. o 


According to Theorem 4.2, the solution of Eq. (4.6) is the upper bound of end- 
to-end delay Dx. 


(4.6) 


total = oon E 
x = Q(x, PC) + | SET PU ps P| - 


M 
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Fig. 4.4 Illustration of  e—— ——— D —————& 
Theorem 4.2 k 
| l«—— — x ———» 
| | Ck | 
| | | 
t to tk 


Equation (4.6) can be solved by the iterative fixed point search [32]. The iterative 
calculation of x starts with x = cx; until the value of x does not change. 


4.5.2 Mixed-Criticality Scheduling Analysis 


In dual-criticality networks, there are three types of worst case delay. 


1. DŁ: the worst case end-to-end delay of the L-crit flow. 

2. Di L: the worst case end-to-end delay of the H-crit flow with the L-crit 
parameter. 

3. DH : the worst case end-to-end delay of the H-crit flow with the H-crit parameter 


We use D(x, Q, c) to denote Qg (x, Q)+ 


total —Q” 
BE pM +c. The methods 


of computing these types of delays are similar. The only difference is that higher- 
priority flows they suffered are different, i.e., their interference sets Q are different. 
H-crit flows have multiple paths. These paths suffer different interference and cause 
different delays. Therefore, we use sub-flows f¥, f; and f; to distinguish them. 

If there are H-crit flows with H-crit parameters in networks, L-crit flows can 
be discarded. Therefore, when we compute the delay DŁ, all flows have L-crit 
parameters. Thus, D = D(x, OF, ci), where OF = {f*IYf*, Ti(L) < Tk(L)} 
and c; = |y. 

Similarly, for H-crit flows with L-crit parameters, the interference set is Q n Lc 
Uf. I IN f Nfl xi = HOD < TO) U UT IV f? TL) < Te(L)}. Thus, 
DES = D(x, OF" c¥), when = jn]. 

An H-crit flow with its H-crit parameter suffers the interference from H-crit 
flows with H-crit parameters. The H-crit flow has two sub-flows f; and ff. 


For these sub-flows, their interference set is oH = {f F Vina = 
H, T;(H) < TD} U(fIVf^, xi = H,T;ı(L) < TxGD)) and c, = |x|, 
e = my Thus, DË = Dœ, OF c1) and D'Y = D(x, QF, c), and then 


DE = max(D/" , Dou 

According to the above delays, the schedulability test is as follows. For the L- 
crit flow fx, if DL x TX(L), it is schedulable; otherwise, unschedulable. For the 
H-crit flow fk, if DH“ < T%(L) and DË x Tk(H), it is schedulable; otherwise, 
unschedulable. If all flows in a flow set are schedulable, the set is schedulable. 
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4.6 Performance Evaluations 


In this section, we conduct experiments to evaluate the performance of our proposed 
methods. 


4.6.1 Scheduling Algorithm 


We consider three comparison methods: (1) SMT uses the Z3 solver [27], which 
is a high-performance solver being developed at Microsoft Research and whose 
solution has been regarded as an excellent standard, to solve our SMT specification 
(Sect. 4.3); (2) noStealRM applies the RM priority and does not allow slots 
to be stolen; (3) StealCM allows slots to be stolen and applies the criticality 
monotonic priority. Our method is StealRM. The performance metric we used is 
the Schedulable ratio, which is defined as the percentage of flow sets for which a 
scheduling algorithm can find a schedulable solution. 

We randomly generate a number of test cases to evaluate these methods. For 
each test case, the number of channels M and the number of nodes |N| are given. 
According to the suggestion in the work [33], these nodes are placed randomly in 
the square area A, and A — hieu) where the transmitting range d is 40 meters. 
Except for the gateway, each node has a data flow from itself to the gateway or 
vice versa. There are two necessary schedulability conditions for flow sets: (1) the 
network utilization U is not largerthan 1; (2) the utilization of each node is not larger 
than 1. If a flow set does not satisfy the two conditions, it cannot be scheduled. Thus, 
in order to make flow sets available, we specify the network utilization U(U < 1), 
and use the method UUniFast [34] to assign the utilization u; for each flow, where 
U = Y uj. Then, if the flow set can satisfy condition (2), it is an available flow 

VfieF 
set. OU due discard it, and repeat the process until an available set is found. The 
period of each flow can be obtained according to T; = E The high-crit probability 
of the flows is controlled by the parameter p. Routing paths are selected randomly. 

In order to make test cases solvable by the Z3 solver, the parameters are set 
as p = 0.3, M = 2 and U = O.8. For each configuration, 100 test cases are 
checked using the four algorithms. Figure 4.5 shows their schedulable ratios. Our 
algorithm StealRM is close to the result of Z3. In these simple test cases, the 
method StealCM has similar results with our algorithm StealRM. Figure 4.6 shows 
the average execution time of solvable test cases in Fig. 4.5. When the number of 
nodes is 25, the execution time of the method SMT is about 16.5 minutes. We also 
use the method SMT to solve the network with 30 nodes, but cannot get the result 
within 3 hours. Except for the method SMT, the execution time of other methods is 
not more than 10 milliseconds. Therefore, from the perspective of execution time, 
heuristic algorithms are significantly more efficient than the method SMT. 
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Since the execution time of the method SMT is too long, the following 
experiments do not contain it. Figure4.7 shows the schedulable ratios of the three 
scheduling algorithms. For each point in the figure, 500 test cases are randomly 
generated. From the figure, we can know that our algorithm StealRM has the highest 
schedulable ratio no matter with which parameters, while the algorithm noStealRM 
has the worst result. Therefore, the stealing mechanism can significantly improve 
the algorithm's performance. Our algorithm StealRM has better performance than 
the algorithm StealCM, especially when the node numbers are higher. This demon- 
strates that: (1) the priority should correspond to the urgency, but not the importance, 
while the stealing mechanism reflects the importance; (2) the urgency and the 
importance have to be distinguished, except in very small networks. Comparing 
these subfigures, we observe that schedulable ratios decrease with the increases of 
p,|N|, U and M. The reasons are as follows. An H-crit flow can be regarded as two 
L-crit flows. Thus, a larger value of the parameter p leads to more flows, which are 
hard to schedule. A test case contains |N | — 1 flows. Likewise, the larger |N | makes 
scheduling hard. The network utilization U corresponds to the network workload. 
Heavy workloads lead to scheduling failures. Note that compared with Fig. 4.7a, d 
has three additional channels, but its schedulable ratios decrease. Because the two 
subfigures generate test cases according to the respective numbers of channels. Their 
test cases are different. Although the number of channels increases, the utilization is 
not changed. When the utilization U is constant, with the increase of the number of 
channels M, the packets that need to be transmitted increase. The increased packets 
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Fig. 4.7 Schedulability comparison among StealRM, StealCM and noStealRM. (a) M — 6,U — 
0.5,0 = 0.3. (b) M = 6, U = 0.5, p = 0.4. (c) M = 6,U = 0.6, p = 0.3. (d) M = 9,U 
0.5, p = 0.3 


will introduce more interference, which has a negative impact on the scheduling 
performance. Therefore, Fig. 4.7d has a lower schedulable ratio than Fig. 4.7a. 

Figure4.8 shows the average execution time of Fig.4.7. As the results are 
similar, we only show two subfigures for Fig. 4.7a, d. Compared with our algorithm 
StealRM, the algorithms StealCM and noStealRM need more time to find feasible 
solutions. Therefore, their execution time slightly increases. From the figure, we 
know that our algorithm StealRM does not introduce extra time costs. For the three 
algorithms, the execution time increases with the increase of the number of nodes, 
since more data flows need to be scheduled. 


4.6.2 Analyzing Method 


The comparison method is SingleAna, in which flow sets are tested using the 
single-criticality analysis. Our mixed-criticality analysis method is MixedAna. The 
performance metrics are the analyzable ratio (the percentage of flow sets which 
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Fig. 4.8 Average execution time. (a) M = 6, U = 0.5, p = 0.3. (b) M = 9, U = 0.5, p = 03 


1$ 18$ 
o 
8 o8 S 08 
S S 
.9 0.6 .9 0.6 
2 2 
Ñ 0.4 Ñ 0.4 
T -e-MixedAna 7 -e-MixedAna 
S 0.2 S 0.2 


-A-SingleAna --SingleAna 


o 
o 


-e-MixedAna 


analyzable ratio 
analyzable ratio 


-A SingleAna 


Fig. 4.9 Schedulability comparison among analyzing algorithms. (a) |N| = 20, M = 6, p = 0.1. 
(b) |N] = 20, M = 6, p = 0.3. (0 |N| = 20, M 29, p = 0.1. (d) |N] = 60, M = 6, p = 0.1 


are tested as schedulable by an analyzing method) and the pessimism ratio (the 
proportion of analyzed delay to the delay observed in StealRM). Figure 4.9 shows 
the comparison of analyzable ratios. For each point, 500 test cases are analyzed. 
Our method MixedAna outperforms SingleAna. The analyzable ratios decrease with 
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Fig. 4.10 Delay comparison with StealRM being used as the baseline. (a) |N| = 20, M = 6, p = 
0.1. (b) |N| = 20, M = 6, p = 0.3. (© |N| = 20, M = 9, p = 0.1. (d) |N| = 60, M = 6, p = 0.1 


the increase of these parameters. The reasons are similar to those in Fig. 4.7. The 
increases of U and M lead to more packets, and the increases of | N| and p lead to 
more flows. These will cause more interference. Thus, the analysis introduces more 
pessimism, and the analyzable ratios decrease. Figure 4.10 shows the pessimism 
ratios of experiments in Fig. 4.9. The pessimism ratios of MixedAna are less than 
2, while the pessimism ratios of SingleAna are all larger than 2. This is because 
the interference that does not exist between H-crit and L-crit flows is eliminated in 
MixedAna. 


4.7 Summary 


Mixed-criticality data flows coexist in advanced industrial applications. They share 
the network resource, but their requirements for the real time performance and 
reliability are different. In this chapter, we propose a scheduling algorithm to 
guarantee their different requirements, and then analyze the schedulability of this 
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scheduling algorithm. Simulation results show that our scheduling algorithm and 
analysis have more performance than existing ones. 
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Chapter 5 ff) 
Mixed-Criticality Scheduling Chente; 
with Multiple Radio Interfaces 


Abstract In this chapter, we introduce the nodes with multiple radio interfaces 
(MRI) into mixed-criticality industrial wireless networks. When an error occurs 
or transmission demand changes, the MRI nodes can switch their transmission 
mode, changing to a high-criticality configuration to meet the system's new demand. 
Hence, we first propose a heterogeneous MRI system model. Based on this 
model, we propose a Slot Analyzing Algorithm (SAA) that guarantees system 
schedulability by reallocating slots for each node after replacing conflict nodes 
with MRI nodes. By considering both system schedulability and cost, SAA also 
reduces the number of MRI nodes. Then, we propose a Priority Inversion Algorithm 
(PIA) that improves the schedulability by adjusting slot allocations before replacing 
conflict nodes with MRI nodes. By reducing the use of MRI nodes, PIA achieves 
better performance than SAA when the system is in the high-criticality mode. 


5.1 Background 


Due to the high real-time and reliability requirements of industrial systems, 
traditional cyber-physical systems cannot be applied directly [1—4]. To guarantee 
the requirements of ICPSs, we introduce multiple radio interface (MRI) nodes into 
ICPSs. A traditional network node is usually equipped with only one antenna, while 
an MRI node with two antennas, can both receive and send packets simultaneously 
[5-7]. By replacing conflict nodes with MRI nodes, multiple flows can transmit 
without delays. If we were to replace all the conflict nodes with MRI nodes, 
that would eliminate transmission conflicts in the system; consequently, when 
the number of MRI nodes is sufficient, we can guarantee system schedulability. 
However, considering the energy consumption and cost of MRI nodes, it is 
advantageous to be able to guarantee the network schedulability using fewer MRI 
nodes [8-12]. Hence, there are two main problems in mixed-criticality industrial 
wireless networks, (1) reducing the number of MRI nodes required to guarantee 
the network schedulability and (2) analyzing the network schedulability under 
different criticality modes. To address these problems, we propose two algorithms, 
a Slot Analyzing Algorithm (SAA) and a Priority Inversion Algorithm (PIA), to 
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improve the network schedulability. SAA first allocates slots without considering 
transmission conflicts and replaces conflict nodes with MRI nodes to guarantee 
future schedulability in low-criticality mode. In contrast, PIA first optimizes slot 
allocation to reduce the number of transmission conflicts; then, we replace conflict 
nodes with MRI nodes only when the system cannot be scheduled. The contents of 
this chapter are as follows: 


1. We first propose an algorithm to obtain the candidate node set (CNSA), which 
indicates the key nodes that affect system performance, and then propose SAA 
to improve system schedulability. SAA can guarantee that a system can be 
scheduled when it satisfies Theorem 5.1. Furthermore, SAA also reduces the 
number of MRI nodes through slot analysis. 

2. We propose PIA to reduce the number of transmission conflicts before replacing 
the overlap nodes with MRI nodes. When a flow cannot be scheduled, priority 
inversion occurs when it satisfies Theorem 5.3. 

3. We analyze system schedulability in high-criticality mode and prove that PIA 
achieves better schedulability than SAA in Theorem 5.4. 

4. Simulation results show that our algorithms can improve the schedulability of 
industrial wireless networks with MRI nodes and that PIA achieves a better 
performance than SAA in the high-criticality mode. 


5.2 System Model 


We consider a heterogeneous network consisting of field devices (both MRI nodes 
and traditional sensor nodes), one centralized controller and one gateway. In this 
section, firstly, we propose a network model, and then we introduce MRI nodes 
into mixed-criticality networks. Finally, we apply a fixed priority (FP) scheduling 
scheme in industrial wireless networks. 


5.2.1 Network Model 


The system model is based on WIA-PA [13]. There are several features as follows: 
(1) Time Division Multiple Access (TDMA); (2) Route and Spectrum Diversity and 
(3) Handling Internal Interference. The number of channels is denoted as m, and 
there are N fixed nodes in our system. 

Industrial systems have higher requirements of real-time performance and 
reliability. Here, we introduce MRI nodes into industrial systems. MRI nodes are 
promising for wireless transmissions because they can improve the average user 
spectral efficiency [14]. By mounting multiple antennas on a single sensor node, 
the node can receive and transmit simultaneously (or improve the packet acceptance 
ratio when all the antennas work on the same channel). 
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Fig. 5.1 An example 


Hence, our model includes two types of sensor nodes: traditional nodes and 
MRI nodes. Each traditional node in our system is equipped with a half-duplex 
omni-directional radio transceiver whose status can alternate between transmitting 
and receiving. In contrast, an MRI node has two working modes: (1) when the 
antennas work on different channels, the node can both receive and transmit packets 
from several paths (the transmission capacity of an MRI node depends on the 
number of antennas it has); and (2) when the antennas work on the same channel, 
the node can reduce signal interference and improve the acceptance ratio because 
several antennas receive a packet from the same flow. Each packet in our system is 
transmitted through the network under source routing. 

Figure 5.1 shows an example of an MRI node in the first mode (where antennas 
work on different channels) to address a transmission conflict in which Fj and 
F send packets to the same destination simultaneously. We introduce MRI to 
solve this issue. The flows can be scheduled by replacing V3 with an MRI node. 
MRI nodes can also be used as a method to increase the packet acceptance ratio 
when two or several antennas receive packets on the same channel. For the second 
mode of an MRI node (where the antennas work on the same channel), when the 
links around V3 have poor signal quality (such as from co-channel interference, 
intermodulation interference, spurious emissions or adjacent-channel interference), 
node V3 can switch its antenna receiving frequencies to the same channel to 
improve the acceptance ratio. However, considering that the power consumption 
and cost of MRI nodes is considerably higher than the cost of a normal node, we 
cannot deploy MRI nodes throughout the entire system. 


5.2.2 Mixed-Criticality System 


Due to the introduction of MRI nodes, our mixed-criticality system model differs 
from other models [15-19]. The flow is a periodic end-to-end communication 
between a source and its destination. There are n flows in our system, denoted 
by F = (Fi, Fo,..., Fa}. F; is characterized by < ti, di, E, cj, pj >, a number 
between 1 < i < n, where the period is t;, the deadline is dj, the criticality level is 
& (we focus on a dual-criticality system (LO, HI}) in which € = 2, meaning the 
system has two critical levels. The superframe as the lowest common multiple of 
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the flow periods is denoted as T. Initially, the system works at a low critical level, 
and the MRI nodes receive packets on different channels. The system switches to 
the high critical level to enhance system reliability, and the MRI nodes reconfigure 
their antennas to receive high critical flow packets on the same channels when the 
accident occurs. Here, c; is the number of hops from a source to a destination, 
and the routing path is p;. When the system critical level switches from low to 
high, the MRI node's work pattern changes from receiving on different channels to 
receiving on the same channels to improve the schedulability of high critical flows. 
In addition, the characteristics of high-criticality flows switch to high-criticality 
mode. In this chapter, the system reduces the sampling period of a high-criticality 
flow to Njf; to improve the sampling rate of high critical flow, where N; is a value 
that satisfies Di < N; < 1. To simplify the calculation we assume that d; = fj. 
Hence, &; satisfies F < N; < 1. We model the duration of the mode switch as y, 
which is used to calculate the delay of packets delivered during the mode change. 
In the beginning, packets are transmitted in low critical mode. To provide more 
resources, each MRI node’s antennas work on different channels. The system 
switches when the accident occurs. To enhance system reliability in high-criticality 
mode, each MRI node’s antennas are reconfigured to work on the same channel. 


5.2.3 Fixed Priority Scheduling 


We provide a summary of fixed priority scheduling to analyze the schedulability of 
systems in this subsection. FP scheduling is a commonly adopted scheme in practice 
for cyber-physical systems and real-time CPU scheduling [20]. Each job priority is 
pre-allocated by the network controller, and transmissions are scheduled based on 
this priority. 

We assume that the priority of each flow is the same as its number. That is, Fi 
has the highest priority, and F, has the lowest priority in the system. There are two 
types of delays in industrial wireless sensor systems: (1) Channel contention and (2) 
transmission conflicts, the definitions can be obtained in [21]. 

We define a system as schedulable when all the flows in a system can be 
scheduled (reach the destination before its deadline). Then the definition of network 
schedulability is whether or not all flows in a network are schedulable. It is worth 
noting that when the system is in high-criticality mode, we no longer focus on the 
schedulability of low-criticality flows. When we repeat Z experiments, and only 
z experiments the emergency flow can be scheduled, then, the schedulability ratio 
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5.3 Problem Statement 


Given the flow set F, a wireless network, and the FP scheduling policy, our objective 
is to use MRI nodes to improve the schedulability of mixed-criticality industrial 
networks. We first analyze the schedulability of mixed-criticality industrial wireless 
networks. Initially, the system is working in low-criticality mode. To improve 
system schedulability, we introduce MRI technology to provide more resources (the 
resources such as slots and channels are increased since MRI nodes can receive and 
send packets simultaneously). Hence, we replace nodes at several intersections with 
MRI nodes and propose a Priority Inversion Algorithm to guarantee that the system 
can be scheduled. However, when the accident occurs, we must guarantee high- 
criticality flows schedulability; consequently, the system switches to high-criticality 
mode, and the transmission mode of MRI nodes changes. We then analyze the 
schedulability of our method to evaluate the quality of PIA. The challenges in this 
situation are listed below. 


1. When the system is deployed, we can easily determine flows that miss their 
deadlines. However, each flow's schedulability is interrelated with others. Thus, 
how should we decide which nodes should be replaced with MRI nodes? 

2. As described in the previous section, the power consumption and cost of an MRI 
node are much higher than those of a normal node. Therefore, it is unreasonable 
and not cost-effective to deploy many MRI nodes when the system can be 
scheduled. Hence, the problem of determining the smallest number of MRI 
nodes that can meet the system requirements in low-criticality mode is another 
challenge. 

3. The MRI nodes’ transmission mode changes when the system switches to high- 
criticality mode. Therefore, the scheduling algorithm also needs to consider 
schedulability in high-criticality mode. 

4. We need to analyze the system's schedulability under our proposed method. 


5.4 Scheduling Algorithms 


In this section, we first study the issue of how to improve system schedulability with 
a small number of MRI nodes. First, we identify the nodes that may be replaced with 
MRI nodes to define candidate nodes as follows: 


Definition 5.1 (Candidate Node) We define a candidate node as a node at which 
transmission conflicts (intersection or overlap nodes) can occur. As Fig. 5.1 shows, 
the paths of F; and F> intersect at V3; consequently, a transmission conflict may 
occur at this node. Thus, V3 is a candidate node. 


Flows may conflict when they have path overlaps. To facilitate candidate node 
identification, we assume that, at most, one part of a path overlaps between two 
flows. As Fig. 5.2 shows, two periodic flows transmit in a network that conflict at 


86 5 Mixed-Criticality Scheduling with Multiple Radio Interfaces 


Fig. 5.2 The analysis of 
conflicts. (a) Routing. (b) 
Superframe. (c) Scheduling 
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the second and third slots in the first period. There are two methods for addressing 
transmission conflicts. The first method is to reallocate the slots for each node 
after replacing the conflict nodes with MRI nodes (as Fig. 5.2 show, the system 
can be scheduled when we replace node V5 with an MRI node, where CH is the 
abbreviation of Channel); the other method is to adjust slot allocation, as shown 
in Fig. 5.3. By rationally allocating each flow's transmission slots, we can reduce 
the number of MRI nodes and the signal interference caused by the coexistence 
of multiple channels, which can also improve the schedulability when the wireless 
network is in high-criticality mode. However, this is unsuitable for a system that 
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Fig. 5.3 An example for — — ——5 the 15! period of F; «——— — — 


-schedulabl flict 
un-schedulable conflic cH [24 4-5 |5-7 |1-9 |2-4 
CH2 


M — the 1* period of F) .€— — — — —— 


CH1 
CH2 /1-3 |3-5 |5-6 |6-8 |8-9 


cannot be scheduled (in this example, the system cannot be scheduled if the deadline 
of F; is 4). 

Therefore, for a system in which each link’s transmission slot has been allocated, 
we propose the slot analyzing algorithm to improve system schedulability under 
a traditional FP policy. When the system cannot be scheduled by MRI nodes, 
SAA re-allocates system resources to guarantee system schedulability (increase the 
transmission speed of unscheduled data flows). By considering the characteristics 
of mixed-criticality systems and MRI nodes, we then propose the PIA algorithm, 
which adjusts slot allocations before replacing intersection nodes with MRI nodes 
(the second method to address transmission conflict). By optimizing slot allocation, 
PIA can guarantee system schedulability with fewer MRI nodes. To guarantee 
the schedulability of the network, PIA will also re-allocate slots when the system 
cannot be scheduled. It is important to note that initially both the SAA and PIA 
algorithms work in low-criticality mode. We will analyze the schedulability of these 
two algorithms later. 


5.4.1 Finding Candidate Nodes 


A network consists of numerous nodes. In this subsection, we study how to select 
candidate nodes to guarantee the system can be scheduled. Transmission conflicts 
occur at the path overlaps of flows. As Fig. 5.4 shows, there are two types of overlaps 
(without considering the flow’s direction). 


Fig. 5.4 Transmission 
conflict 
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Lemma 5.1 When flow paths have an overlapping region, the overlap nodes are 
candidates for MRI nodes. We denote the candidate node set as ^, and the node set 
on each flow's path can be denoted as Ài. 


Proof Transmission conflict can obviously only occur at an overlapping region. We 
can solve this issue by MRI nodes. We can account for the conflicts caused by the 
first type as shown in Fig. 5.1. For the transmission conflict caused by the second 
type, as Fig. 5.4 shown, there are two nodes (V 1 and V2) on both F1 and F> involved 
in the second type of overlap. When this type of transmission conflict occurs, we can 
improve the schedulability by replacing both V1 and V2 with MRI nodes. Hence, 
when flow paths have an overlapping region, the overlap nodes are candidates for 
MRI nodes. o 


Hence, we propose a Candidate Node Searching Algorithm (CNSA) to search 
for the set of candidate nodes as follows: 


Algorithm 5.1 Candidate node searching algorithm 
Input: F; 
Output: the candidate node set A = {A;}, i € F; 
1: for each flow F; do 
if node n; is the overlap node then 
ng join A; 

end if 
end for 
return A; 


DA BY ty 


We search the candidate nodes by traversing the flow paths of the entire system. 
Nodes on more than one flow path are added to the candidate node set A. The time 
complexity of CNSA is O(F?). 


5.4.2 Slot Analyzing Algorithm 


After obtaining the candidate node set, we reduce the number of nodes in this set 
to reduce the system's cost. Because not all the candidate nodes can experience 
transmission conflicts, we propose the slot analyzing algorithm to analyze the 
schedulability of the network under a traditional FP policy in one superframe (a 
superframe is the lowest common multiple of fj, i € F). SAA improves system 
schedulability by replacing some of the nodes in A with MRI nodes. 

Since SAA is used after the CNSA, we do not consider the slot allocations for 
each flow; we just replace conflict nodes with MRI nodes when a transmission 
conflict occurs. 

When the system cannot be scheduled, we need to improve the schedulability 
using MRI nodes. Because MRI nodes can both receive and send packets in a single 
time slot, we first allocate slots for each node without considering transmission 
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conflicts (the network controller allocates the channel for each transmission using 
the FP scheduling policy). Then, we replace conflict nodes with MRI nodes to 
guarantee system schedulability. Obviously, it is both unnecessary and not cost- 
effective to replace all the nodes in the candidate node set with MRI nodes. 
Therefore, SAA reduces the number of MRI nodes in A through slot analysis. The 
pseudo code of the SAA algorithm is as follows: 


Algorithm 5.2 Slot analyzing algorithm 


Input: the characters for each flow F;; the candidate node set A = {A;}, i € F; 
Output: the schedulability of the network; 
1: reallocate the slot for each node. 


2: for each flow i do 
3:  ifthe flow cannot be scheduled then 
4: find the intersection nodes and reallocate slots for F; without considering transmission 
conflicts. 
5 else 
6: retain the original allocation. 
7 end if 
8: end for 
9: for each node A; € A do 
10: if A; has two or more than two transmissions in the same time slot in one superframe then 
11: A; needs to be replaced; 
12: endif 
13: end for 


14: if the system also cannot be scheduled then 
15: for each unscheduled flow do 


16: enhance the unscheduled flow’s transmission speed; 

17: if A; has two or more than two transmissions in the same time slot in one superframe 
then 

18: A; needs to be replaced; 

19: end if 

20: end for 

21: end if 

22: update A; and A = (Ai; 

23: return A; 


We reallocate the transmission time slots for each node by the schedulability of 
each flow (lines 1—8). If the flow cannot be scheduled, we find the intersection nodes 
and reallocate the slots for this flow without considering transmission conflicts. 
Otherwise, we retain the original allocation. Then, we analyze the transmission slot 
for each node in A (lines 9—23). When a node in A has more than one transmission 
at the same time slot, that means a transmission conflict occurs at this node, and 
we need to replace the node with an MRI node (lines 9-13). When the system 
also cannot be scheduled, SAA then re-allocates slots to accelerate the transmission 
speed of unscheduled flows until the flows can be scheduled (lines 14—21). Finally, 
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SAA updates 4; and returns A—the set of nodes that need to be replaced (lines 
22-23). The theorem is as follows: 


Theorem 5.1 A network can be scheduled with SAA when the number of channels 
is no less than the number of flows (m > n). 


Proof When k flows conflict at node A, we denote the flow with the highest priority 
as F|, and the flow with the lowest priority as Fg. Fi transmits first and cannot be 
delayed at node A. The other flows must wait on F|,consequently, this generates 
delay. When the number of channels is no less than the number of flows, no delay 
is caused by transmission conflicts. Hence, flow F;,i € k will miss its deadline 
and cannot be scheduled when cj + del; > dj, where del is the delay slots. Using 
SAA, we can eliminate the delays caused by transmission conflicts. k flows can 
transmit simultaneously when m > n because the number of hops in each flow's 
transmission c is no larger than its deadline d. Thus, we can guarantee all the flows 
can be scheduled and, therefore, the system can be scheduled using SAA when the 
number of channels is no less than the number of flows. o 


5.4.3 Priority Inversion Algorithm 


In this subsection, we study how to reduce the number of MRI nodes by the 
second method, the PIA scheduling algorithm, which performs optimal allocation 
of resources before the system runs. Through optimal allocation, PIA can decrease 
the number of transmission conflicts at intersection nodes and further reduce the 
number of required MRI nodes. As Fig. 5.5 shows, V5 does not need to be replaced 
by an MRI node when d; — 6 in Fig. 5.2a. 

Initially, PIA allocates slots based on the traditional FP policy. If the system 
cannot be scheduled, the network controller obtains each flow's arrival time at its 
destination, which can be denoted as r;. Then, we can obtain the following theorem, 


Theorem 5.2 When F; and Fj conflict at node set V = Vi, V2...,Vn (i < j), 
we denote the length of the overlap as Len(ij), and its conflict delay is D(ij). The 
schedulability of F; is unaffected by the priority inversion of F; and F; when it 
satisfies:(1) r; + D(ij) < di;(2) V is the last part of the overlapping nodes of F;. 
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Proof The priority of F; is higher than that of F; based on the assumption that 
i < j. When transmission conflict occurs at node set V = Vj, V2..., Vn, Fj must 
be delayed by F;. We can accelerate the transmission speed by transmitting Fj first 
when the schedulability of F; does not be changed. We introduce the result proposed 
by Saifullah in [22], in which the upper bound A(i, j) of the delay that F; can 
experience from an instance of F; is 


à(ij) (ij) 
AGj) = J Lenk(j) — 3 ^ (Lene (ij) — 3), (5.1) 


k=1 k'zl 


where 3 (ij) is the number of path overlaps, Len; (ij) is length of the kth overlap on 
F; and F}, 5'(ij) is the number of path overlaps larger than 3. Hence, the problem 
is transformed into one of calculating the delay F; must bear while waiting for F; 
to transmit. By simplifying Eq. (5.2), we can obtain the delay in V as follows: 


Len(ij), Len(ij) « 4, 


D(ij) = i: e 
Len(ij) —3, Len(ij) => 4. 


(5.2) 


When V is the last part of the overlap nodes of F;, no other factors can cause 
additional delay to F;. Hence, the schedulability of F; is unaffected by the priority 
inversion of F; and F; when it satisfies:(1) r; + D(ij) < dj; and (2) V is the last 
part of the overlap nodes of F;. o 


Based on Theorem 5.2, PIA accelerates the unscheduled flow until it can 
be scheduled or no longer influences the other flow's schedulability. However, 
Theorem 5.2 is not suitable for all overlaps. When F; involves several parts of 
overlaps, we can only guarantee the acceleration of the low priority flow on the 
last overlap. For the flow's intersections with F; in the other parts of the overlaps, 
we define an Effective Overlap Region(EOR) as follows: 


Definition 5.2 (Effective Overlap Region) When F; and F; conflict in node set 
V = Vi, V2..., Vp (i < J), the overall node set V is the effective overlap region. 
If F; and F; have an overlap but do not conflict, then V is not an effective overlap 
region. 


Hence, Theorem 5.2 can be extended as follows: 


Theorem 5.3 When F; and Fj conflict at node set V = Vi, V2...,Vn (i < j), 
the schedulability of F; is unaffected by the priority inversion of F; and F; when it 
satisfies:(1) rj + D(ij) < di; (2) it does not change the character of non-EORs after 
V; and (3) V is the last EOR of Fi. 


Proof Based on Theorem 5.2 we extend priority inversion to the last part of the 
EOR of F;. When the operation satisfies (1) r; + D(ij) < dj; (2) it does not change 
the character of non-EORs after V ; and (3) V is the last part of the EOR of F;. Then, 
the schedulability of F; and its intersection flows are unaffected by the wait for F; 
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to transmit. Hence, when F; and F; conflict at node set V = Vi, V2..., Vh (i < j), 
the schedulability of F; is unaffected by the priority inversion of F; and F; when it 
satisfies:(1) r; + D(ij) < di; (2) it does not change the character of non-EORs after 
V; and (3) V is the last part of the EOR of F;. o 


To guarantee the second constraint condition in Theorem 5.3, we denote the 
number of overlaps after the last EOR in F; as oj. For any flow k, when Fg and 
F; have a path overlap after the last EOR in V, we denote this overlap path as 
Oik. Hence, the fact that the second condition in Theorem 5.3 can be translated 
into the priority inversion operation does not affect the schedulability of F; and F;. 
Obviously, Fy and F; cannot conflict when the packets generated by the two flows 
arrive such that oj; satisfies 


Oik 


lr? — rz*l > Dij, (5.3) 
where p is the time at which the packet generated by F; reaches oj, (it is easy to 
obtain oj; from the network controller). When the arrival time difference of F; and 
Fy is larger than Dij, oig is not an EOR. By updating p. i € F, wecan determine 
whether to invert the priority of F; and F;. 

Based on the above discussion, PIA can reduce the number of transmission 
conflicts before adding MRI nodes to the system. Furthermore, PIA can also 
improve the schedulability of networks under a high-criticality mode. When the 
system cannot be scheduled under PIA, the nodes at flow intersections will be 
replaced by MRI nodes to guarantee the schedulability of the network. The pseudo 
code of PIA is as follows, 


Algorithm 5.3 Priority inversion algorithm 


Input: the characters for each flow F;; the candidate node set A = {A;}, i € F. 
Output: reduce the number of transmission conflicts by optimal slot allocations; 
1: reallocation slots for each node by FP. 

2: the number of MRI nodes t = 0. 

3: while the system cannot be scheduled do 

4 for each flow i do 

5 obtain each flow's oj, Len and the corresponding oj, i, k € F; 
6: endfor 

7 for each unscheduled flow j do 

8 if flow i satisfies Theorem 5.3 then 


9: invert the priority between F; and Fj; 
10: else 
11: replace the overlap nods with MRI nodes; 
12: T +++; 
13: end if 
14: end for 


15: end while 
16: return T; 
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PIA first allocates slots under the FP policy (line 1) and initializes the number 
of MRI nodes (line 2). Then, it judges whether the system can be scheduled or 
not. If the system can be scheduled, it returns t, otherwise, PIA accelerates the 
transmission speed of unscheduled flows by Theorem 5.3. When the flows do not 
satisfy the conditions, PIA replaces the overlap nodes with MRI nodes, similar to 
SAA, and increments t by one (lines 3-16). 

When the system performs the priority inversion operation at the last EOR, the 
conflict is removed. Hence, that part of the overlap is no longer an EOR. The system 
repeats this process until either the system can be scheduled or no flow satisfies 
Theorem 5.3. 


5.4.4 Algorithm Analysis in High-Criticality Mode 


In this subsection, we analyze the schedulability of SAA and PIA. Since both SAA 
and PIA can be scheduled by adding MRI nodes in low-criticality mode, we analyze 
only the schedulability in high-criticality mode. 


Theorem 5.4 PIA has a higher schedulability than SAA when the system is in high- 
criticality mode. 


Proof When the system switches to high-criticality mode, low-criticality flows are 
abandoned and the high criticality flow period changes to N;t;, A <N <l. 

Because t; = dij, the flow must arrive at its destination before N; ud. We analyze 
the schedulability by a discussion of classification. The flows are classified into two 
categories: (1) those in which the flow does not overlap other high-criticality flows 
and (2) those in which the flow overlaps other high-criticality flows. For F; in the 
first category, the schedulability of both SAA and PIA are identical. When a flow is 
not blocked by other flows it can be scheduled when 


ri € Widi. (5.4) 


For Fj; in the second category, F; is delayed by other flows in D(j) slots. Then, 
F; can be scheduled when 


rj + D) € Njdj. (5.5) 
We denote the number of MRI nodes in SAA and PIA as 1944 and r?/4, 
respectively. Because PIA reduces the number of transmission s Before 


replacing intersection nodes with MRI nodes, pe >T H 1^. By Theorem 5.2, we 
can obtain 


DG) = > DG, (5.6) 
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i iA Len(ij), Len(ij) < 4 -— 
when F; and F; conflict, D(ij) = , Which is 
(Len(ij) -3, Len(ij) > 4 


determined only by the length of the overlap. Then, we can obtain the relationship 
between SAA and PIA as follows: 


AEDU sre POG. pu) 


Hence, PIA has a higher schedulability than SAA when the system is in high- 
criticality mode. o 


5.5 Performance Evaluations 


We evaluate the performance of our proposed methods by experiments. We compare 
our approaches with the traditional FP algorithm without MRI nodes. We compare 
both the acceptance rate and the number of MRI nodes for each criticality mode. We 
use the acceptance rate to represent the schedulability of a network. When all flows 
can be scheduled, the acceptance rate is 1; otherwise, it is 0. To control the workload 
of the entire system, the simulations use the UUniFast algorithm, which can make 
the flows neither pessimistic nor optimistic for the analysis [23]. All algorithms 
are implemented in C language. These programs run on a Windows machine with 
3.2 GHz CPU and 8 GB memory. Some simulation parameters are summarized in 
Table 5.1. 


5.5.1 Low-Criticality Mode 


We first compare the performances of the algorithms in low-critical mode. As 
Fig. 5.6(a) shown, the system acceptance rate is decreased by the FP scheduling 
policy (n=15, N=50, m=16), which occurs because the idle resources decrease as 
the system utilization increases. The latency tolerance of a packet is reduced along 


Table 5.1 Simulation 
parameters 


Parameter | Description 


n The number of flows 

N The number of nodes 

m The number of channels 
U Network utilization 

ui Flow i's utilization 

t The period of flow 

d The deadline of flow 


c The number of transmission hops 
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Fig. 5.6 Relationship 1.2 
between the acceptance fide pie ee os EE EUR ET d EF rur 
rate/the number of MRI 


nodes and system utilization. 
(a) Acceptance rate. (b) The 
number of nodes 


Accept Ratio 
o o 
o co 


0.4 
=è -SAA/PIA 
0.2 
FP 
0 
0.2 0.3 0.4 0.5 0.6 0.7 
Utilization 
(a) 
20 —Ħ— Candidate Nodes —96 -MRI Nodes in SAA 


o x= MRI Nodes in PIA 

© 

o 

c 

Pa 

o 

ES 

v 

a 

E 

3 

c 

ov 

<= 

£ 

0.2 0.3 0.4 0.5 0.6 0.7 
Utilization 

(b) 


with the idle resources. The network can be scheduled under both SAA and PIA 
in any situation by increasing the number of MRI nodes when m > n. Figure 5.6b 
shows the relationship between the number of MRI nodes and utilization. Obviously, 
both SAA and PIA can reduce the number of MRI nodes under the premise that 
the system is schedulable. Because PIA optimizes slot allocation before the node 
replacement operation, PIA uses fewer MRI nodes than SAA. However, none of the 
curves have an obvious tendency because system utilization involves not only the 
flow period (t) but also the number of transmission hops (c). We need to regenerate 
the transmission path for each flow to satisfy the system utilization requirements. 
Hence, the number of candidate nodes goes up and down. Because an MRI node is 
selected from the candidate node set, the number of MRI nodes is always less than 
the number of candidate nodes. 

We repeat this simulation for the situation in which all flows can be scheduled 
as shown in Fig. 5.7. The number of candidate nodes is fixed when we increase 
system utilization by only adjusting the flow period (because there is only one test 
in this simulation, and the flow transmission paths do not vary). Initially, no MRI 
nodes exist in the system because they can be scheduled without any MRI nodes. 
However, when we increase the system utilization, transmission conflict occurs. 
To guarantee the schedulability of the system, we need to add MRI nodes to the 
system. In addition, the number of MRI nodes required by PIA is always less than 
the number required by SAA. This occurs because by optimizing slot allocation, 


96 5 Mixed-Criticality Scheduling with Multiple Radio Interfaces 


Fig. 5.7 Relationship 8 
between the number of MRI 


nodes and system utilization 6 
E —ÀHRÉ—— HR eee 


=Ħ= Candidate Nodes M 
PE 


the number of nodes 
A 


=@ -MRI Nodes in SAA e^ ' 
- 
MRI Nodes in PIA ~~ 
T 
2 e-.-.e 
4 
7 
0 é* 
0.2 0.3 0.4 0.5 0.6 0.7 
Utilization 

Fig. 5.8 Relationship 12 
between the acceptance 14 er . FONDOS 
rate/number of MRI nodes " | E T ae a | 
and the number of flows. (a) $ 08 
Acceptance rate. (b) The z 0.6 
number of nodes o 

9 0.4 =@ -SAA 

< —— FP 
0.2 PIA 
0 


10 11 12 13 14 15 16 17 18 
the number of flows 


(a) 
30 
—m— Candidate Nodes 
n 25 
o = -MRI Nodes in SAA 
3 20 
< MRI Nodes in PIA 
'6 15 
tæ n - 
2 10 =ar 
E 
g5 
o 
$0 


10 11 12 13 14 15 16 17 18 
the number of flows 


(b) 


PIA can reduce the number of transmission conflicts. Hence, the number of MRI 
nodes required by PIA is always less than the number required by SAA. 

The relationship between the acceptance rate/number of MRI nodes and the 
number of flows is shown in Fig. 5.8 (U=0.3, N50, m=16). The acceptance rate 
is reduced along with the number of flows under the FP scheduling policy. The 
acceptance rates of SAA and PIA are reduced when the number of flows becomes 
larger than the number of channels. This result occurs because when m « n, delays 
are caused by channel contention. PIA achieves a better performance than SAA 
when n > 16 because it uses fewer MRI nodes. In addition, both the number of 
candidate nodes and the number of MRI nodes increase as the number of flows 
increases. This result occurs because the number of intersections increases when 
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the number of flows increases, causing more transmission conflicts in the system. 
Thus, the system requires more MRI nodes to guarantee its schedulability, and the 
number of MRI nodes always satisfies v" > 1544 > 1P/A, 

Figure 5.9 shows the relationship between the acceptance rate/number of MRI 
nodes and the number of nodes (U=0.3, n=15, m=16). All these results indicate 
that SAA and PIA can guarantee the schedulability of the system. The number of 
MRI nodes in both SAA and PIA is no larger than the number of candidate nodes 
regardless of the conditions. In addition, when the number of nodes increases to 90, 
the number of MRI nodes in PIA and SAA is identical because PIA transmutes into 
SAA when it cannot resolve the conflict. 


5.5.2 High-Criticality Mode 


When the system switches to high-critical mode, the number of MRI nodes is no 
longer our main concern (the number of MRI nodes is the same as in low-critical 
mode). Instead, we are more concerned with the performances of SAA and PIA in 
high-critical mode. 

Figure 5.10 shows the relationship between the system acceptance rate and 
system parameters such as the number of nodes and the number of flows. In these 
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Fig. 5.10 Relationship between the acceptance rate and system parameters in high-critical mode. 
(a) Acceptance rate. (b) The number of nodes 


conditions, we set X = 0.9 > max{+, i € F).If max{+, i € F} > 0.9, we will 
regeneration the system. Figure 5.10 illustrates (as Theorem 5.4 proves) that the 
acceptance rate in PIA is always better than in SAA. 

Figure 5.11 shows the relationship between the system acceptance rate and sys- 
tem utilization. To study the relationship between these two elements, we increase 
system utilization by adjusting only the flow period. Subsequently, we obtain the 
same result as Fig. 5.10, which also verifies the correctness of Theorem 5.4. 


5.6 Summary 


In this chapter, we first introduce MRI nodes into mixed-criticality networks. Then, 
we analyze the transmission paths and obtain the candidate node set. Next, based 
on the characteristics of MRI nodes, we propose SAA and PIA to guarantee 
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Fig. 5.11 Relationship between the number of MRI nodes and system utilization in high-critical 
mode 


the network schedulability in low-criticality mode. By considering system cost, 
these two algorithms help to reduce the number of MRI nodes used. Finally, we 
analyze the schedulability of these two algorithms when the system switches to 
high-criticality mode. The simulation results show that our scheduling algorithms 
perform better than existing scheduling policies. 
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Chapter 6 A) 
Mixed-Criticality Scheduling on 5G New — «x 
Radio 


Abstract Compared to industrial wired networks, 5G can improve device mobility 
and reduce the cost of networking. However, the real-time performance and 
reliability of 5G NR (new radio) still need to be improved to satisfy industrial 
applications' requirements. In factories, the main factor that affects the performance 
of 5G NR is the unstable signal quality caused by high temperatures and metal. 
Although assigning dedicated resources to all transmissions and retransmissions 
is an effective method to improve the performance of 5G NR, the unstable signal 
quality causes the resources required for retransmissions to be uncertain. To address 
the problem, we introduce the mixed-criticality task model to 5G NR. When high- 
criticality packets cannot be transmitted, they are allowed to preempt the resources 
shared with low-criticality packets. The mixed-criticality scheduling problem of 
5G NR is NP-hard. We formulate it as an OMT (optimization modulo theories) 
specification and propose a scheduling algorithm based on bin packing methods 
to make 5G NR satisfy industrial applications’ requirements. Finally, we conduct 
extensive evaluations based on an industrial 5G testbed and random test cases. 
The evaluation results indicate that our algorithm makes communication reliability 
greater than 99.9% on unlicensed spectrum, and for most test cases, our algorithm 
is close to optimal solutions. 


6.1 Background 


Ultra reliable low latency communication (URLLC) is one of main application areas 
defined in 5G. It aims to provide low latency and ultra-high reliability for mission- 
criticality services, which widely exist in industrial systems. Since the performance 
of URLLC is comparable to some wired networks, industrial systems are adopting 
5G in place of wired networks to improve the mobility of devices and reduce the 
cost of networking [1-3]. 

For URLLC, all transmissions and retransmissions of industrial data must be 
assigned the resources of 5G NR (new radio), in advance, by a scheduling algorithm 
that runs in the base station. The resources of 5G NR include time slots and 
frequency bandwidth. In factories, high temperatures, high humidity, and metal 
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seriously affect signal qualities. Thus, more retransmissions are needed to guarantee 
the reliability of industrial communications. However, on the one hand, the signal 
quality is dynamic and unpredictable. Before data packets are actually transmitted, 
the scheduling algorithm cannot determine how many retransmissions and network 
resources are sufficient for the highly reliable communications. On the other hand, 
the scheduling algorithm cannot assign as many resources as possible to all packets 
because wireless network resources are limited. Therefore, in order to make 5G NR 
meet the real-time and reliability requirements of industrial systems, the scheduling 
problem of 5G NR needs to be studied. 

5G NR supports two-dimensional (2D) time-frequency resources. Since the 2D 
resources are different from other systems, some researchers have begun to study the 
new scheduling problems for 5G. The work in [4] proves the NP hardness of the new 
problem and proposes an algorithm based on Lagrangian duality to guarantee the 
real-time performance of as many services as possible. The work in [5] aims at the 
same objective and proposes two heuristic algorithms to generate schedules quickly. 
The work in [6] applies machine learning to improve real-time performance and data 
rate. Similarly, the work in [7], based on machine learning, proposes an energy- 
efficient real-time scheduling algorithm. To make the proposed algorithms usable in 
actual systems, some real factors have been considered. The work in [8] considers 
the on-off operation of power amplifiers in the scheduling problem and proposes 
a sliding window-based algorithm to optimize the real-time performance and the 
energy efficiency for service transmissions. The work in [9] focuses on the impact of 
interference and channel estimation error on data rate. The work in [10, 11] studies 
how to guarantee the delay requirement under the minimum bandwidth. Although 
the scheduling problem of 5G NR has been studied more and more widely, the 
mixed-criticality scheduling problem of 5G NR has not been considered. 

In industrial systems, control commands are the most important and must be 
delivered to devices in time, while some less important data, such as system logs and 
routine monitoring, can be delayed. Hence, under the limitation of 5G NR, the best 
way to schedule packets is to assign more resources to important packets, and allow 
the other packets to use the rest of the resources and the idle resources that have been 
assigned to the finished important packets. In other words, when the resources of 5G 
NR are insufficient, unimportant packets have to be discarded first. This process is 
typical of mixed-criticality scheduling. Some novel algorithms have been proposed 
to address the mixed-criticality scheduling problems of networks [12-17]. However, 
these algorithms cannot be used in the 5G NR model. 

In this chapter, we introduce mixed criticality to 5G NR and propose a mixed- 
criticality scheduling algorithm to improve the real-time performance and reliability 
of industrial 5G networks. Much research has focused on mixed-criticality schedul- 
ing algorithms. However, the two main characteristics of our problem are not 
considered in other research. 
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1. On 5G NR, the available resources include time slots and frequency bandwidth, 
while in other mixed-criticality systems, the resources are time slots and 
processors. Compared to processors, frequency bandwidth is finer-grained and 
can be divided and converged. Although this characteristic contributes to the 
flexibility of scheduling algorithms, it also makes scheduling algorithms more 
complicated and more difficult to find optimized solutions. 

2. In related work about mixed-criticality networks, e.g., [18, 19], when high- 
criticality packets are transmitting, all the low-criticality packets have to be 
discarded. However, our scheduling algorithm tries to guarantee the performance 
of all the high- and low-criticality packets. This difference makes us have no 
related work to refer to. 


To solve the mixed-criticality scheduling problem of 5G NR, this chapter 
includes the following: 


1. First, to rigorously state the problem, we propose a specification based on 
optimization modulo theories (OMT). The problem can be reduced to the bin 
packing problem. Therefore, our problem is at least NP-hard. Based on the 
specification, some off-the-shelf solvers can find optimal solutions. 

2. Second, to improve the scalability of our work, we propose a heuristic, 
pseudolevel-packing algorithm to assign dedicated and shared resources to 
packets. In the algorithm, we extend real-time constraints and mixed criticality 
to the traditional bin packing problem, and analyze the sufficient condition and 
necessary condition for schedulability so that the solution space can be reduced 
effectively. 

3. Third, we implement an industrial 5G testbed and evaluate our proposed 
algorithms. To compare all the algorithms under the same signal quality, we 
record signal qualities into trace files. Then, we conduct simulations based on the 
trace files and extensive test cases. The results indicate that our algorithm makes 
communication reliability greater than 99.9% on unlicensed spectrum, and for 
most test cases, our proposed algorithm is close to optimal solutions. 


6.2 Problem Statement 


The symbols used in this chapter are summarized in Table 6.1. 

We consider the scheduling problem under one base station and many users. A 
data flow is from a user to the base station (or from the base station to a user). In 
the following, we ignore the direction of flows because flows in different directions 
have the same resource requirement. In the flow set F, all flows have the same 
period P. Each flow f; generates a packet at time j x P (j € Z), and the packet 
must be delivered to its destination before its absolute deadline (j + 1) x P. Since 
the packets contained in different periods have the same resource requirements, we 
only consider how to schedule packets in the first period P. After the first period, 
the subsequent schedules are periodically repeated. 
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Table 6.1 Symbols 


Symbol Definition 

P Period 

F Flow set 

fi The i-th flow 

Xi The highest criticality level of f; 

Ci Transmission time durations 

li Frequency bandwidth 

X The highest criticality level of the network 

F* Set of flows with x; = e 

Tj Packet generated by f; 

cl Transmission time duration of r; at criticality level j 
Xj Start time of t; 

yi Start frequency of r; 

F Packet set 

L Bandwidth of the available resources 

qi If v; is covered, q; = 1. 

w Weight of the e-th criticality level 

ps Set of packets generated by the flows in F° 

fi If t; and v; cover or overlap each other, then r; ; = 1. 
I’ Ordered set of packets 

T The i-th packet in I" 

[A Frequency bandwidth of t’ 

ce Transmission time duration of t’ at criticality level e 
h Finish time of all placed packets 

aly]I[x] If a[y][x] — 1, the corresponding resource is occupied 
y The last row above row y 

Bl, B?, p? Three types of packets 

clc Transmission durations of B!,B? and B?, respectively 
11,1, P Frequency bandwidths of B!,B? and B?, respectively 
nl, n?, n? Number of local levels with lengths of cl (c? or e?) 
R Resources actually occupied by all packets 

E Resources required by Algorithm 6.1 

ôj Metric that indicates the difference between r^ and t/ 
T; Packet set returned by Cover (k) 

N Number of packets 


Flow f; is characterized by a three-tuple < x;, Ci, l; >, which denotes its highest 
criticality level, transmission time durations and frequency bandwidth, respectively. 
The highest criticality level of our network is set to X, and for each fj, 1 < x; < X. 
We use F* to denote the set of the flows with x; = e. The packet generated by f; 
is tj. C; = iet e^. T gj is the set of transmission time durations of t;. The flow 
with larger x; can occupy more time slots. At criticality level 1, v; is transmitted 
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Fig. 6.1 Flow model. (a) (i.c) at criticality level 1. (b) v; at multiple criticality levels 
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Fig. 6.2 Two packets overlap each other 


once in one time slot, and at criticality level j, v; is transmitted j times in j time 


slots, i.e., c] —jx cl; 5G numerology defines three subcarrier spacings (180, 
360 and 720 KHz) and three corresponding slot lengths (1000, 500 and 250 us). We 
define 15 kHz and 250 ps as the unit bandwidth and the unit slot length, respectively. 
Then, (/j, cl) € ((1, 4), (2, 2), (4, 1)} (as shown in Fig. 6.1a). We use x; and y; 
to denote the start time and start frequency of the transmission of r;, respectively. 
The illustration is shown in Fig. 6.1b. Initially, at the lowest criticality level, v; is 
transmitted once in time duration B. If vj is not sent successfully in the duration, 
then its critically level is set to 2. At criticality level 2, the transmission time 
duration is increased to e and rj is transmitted again. Repeat this process until the 
transmission is successful. If the transmission still fails in the longest transmission 
time duration e , tj has to be discarded. 

For any two packets t; and tj, the resources assigned to them are not allowed to 
overlap each other (Definition 6.1). An example is shown in Fig. 6.2. At criticality 
level 2, the two packets share resources. Then, rj may be preempted by r;. At the 
same criticality level, the packets are equally important and cannot preempt each 
other. Therefore, in feasible solutions, packet overlapping is not allowed. 
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Fig. 6.3 rj covers Tj criticality (etc ys) 


Ti 
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Fig. 6.4 Sufficient resources criticality 
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Definition 6.1 1; and t; overlap each other (as shown in Fig. 6.2), if the following 
conditions are all met. 


1. 


On the x axis (time dimension), t; and rj overlap, i.e., xj € xj < xi + p 
or xi € xj + en ! < xi + p P 2 We only check the highest criticality 


level occupied by both of them, i.e., min{x;, Xj}. At the other lower criticality 
levels, they occupy fewer time slots. Even if they do not overlap at the lower 
criticality levels, they may overlap at criticality level min{x;, x ;}. 


. On the y axis (frequency dimension), v; and v; overlap, ie. y; € yj < yi + li, 


or yi € yj t lj € yi li. 


If t; covers v; (Definition 6.2), t; may not be sent. As shown in Fig. 6.3, when 


tj is being transmitted at criticality level 3, the resources assigned to v; are being 
occupied by z;. Then, v; has to be discarded. 


Definition 6.2 t; covers v; (as shown in Fig. 6.3), if the following conditions are 
all met. 


1. 
. The highest criticality level of v; is larger than that of v;, i.e., x; > Xj. 
. On the x axis (time dimension), t; and v; share the same time slots at the different 


t; and v; do not overlap each other (Definition 6.1). 


criticality levels, i.e., x; € xj < x; + e orx; € Xj c T « xj t d 


. On the y axis (frequency dimension), z; and v; share the same frequency 


bandwidth, i.e., y; € yj € yi - li; or yi € yj lj € yi t li. 


Our objective is to send as many packets as possible. Once a packet is covered, 


it may be discarded. If there are sufficient resources, no packets can be covered. For 
example, x; is set to x; + e (as shown in Fig. 6.4). Then, v; and v; do not cover or 
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overlap each other. No matter which criticality level v; is at, v; is not affected. Only 
when resources are insufficient, v; is allowed to cover tj. 

We formulate our problem as an OMT specification [20]. OMT is an extension 
of satisfiability modulo theories (SMT), which has been widely used to determine 
whether a specification is satisfiable or not. In addition to OMT supporting all 
the operators of SMT, it can also find an optimal objective. Our problem is to 
send as many packets as possible under scheduling constraints. Therefore, OMT 
is the best choice for our problem. The solution found by OMT not only satisfies 
scheduling constraints but also maximizes the number of packets sent. In the 
following specification, ^, v and — denote the logical operations of conjunction, 
disjunction and negation, respectively. 

Each flow in F generates one packet. All the packets are included in the packet 
set I’. These packets are transmitted in resources including a bandwidth of L and a 
time duration of P. The problem is how to determine (xj, yj) for each packet such 
that as many packets as possible are transmitted, and high-criticality packets are not 
covered as much as possible. Therefore, the objective is to minimize the weighted 
sum of the number of covered packets, as follows: 


min » (o x > qi), (6.1) 


vee[1, X] Were 


where qi = 1 indicates that v; is covered by a higher-criticality packet, c is the 
weight of the e-th criticality level, and I'^ includes the packets with x; = e, i.e., 
T°’ = (u|Vf; € F, xi = e}. In order to ensure that high-criticality packets are 
not covered as much as possible, we set that œ! = ] and Ve € [2, X], o* = 
2 Veelt,e—11 wE x |[8| + 1, i.e., œ is greater than the weighted sum of all lower- 
criticality packets. Hence, to minimize the objective, when there are not sufficient 
resources, low-criticality packets are covered first. 
To check if t; and vj cover or overlap each other, we define the following function 
Disjoint(i, ci, j, cj) 2(xi = xj +cj) V OF = yjclj) (62) 
V(x;jzxictc)v(yjzyicli). 


The function only considers time-frequency resources, and criticality levels are 
reflected in c; and cj. 
We use r; j to bridge q; and Disjoint(). Vf; € F, Vf; € (FXi*!,..., FX), 


(rij == 1) ^ —Disjoint (i, c% , j, c?) 
| : (63) 
V ((ri,j == 0) ^ Disjoint(i, c™, j, oy); 


t 


1> qi ÈZ fij = 0. (6.4) 
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In Eq. (6.3), if v; and v; cover or overlap each other, r;,; is equal to 1. However, 

the following constraint 2) can avoid overlapping between two packets. Thus, only 

when r; covers Ti, 7;,; = 1. Then, in Eq. (6.4), if there exists r; į = 1, then q; = 1. 
The minimizing problem has to respect the following constraints. 


1. Range constraint: The ranges of variables used in the specification are as follows. 
Vf; € FO < xj < P—c*,0<y; -«L-lj, 
qi € [0, 1], r; j € {0, 1]. 


(6.5) 


2. Overlap constraint: Any two packets are not allowed to overlap each other. Note 
that in Eq. (6.6) the transmission time duration of v; is at the highest criticality 
level of v; because the overlap of two packets only occurs at the same level. 


V fi € F, Yr; € (T*|Ve € [xi, X]}, Disjoint (i, e. jc) = true. (6.6) 


A packet set is called schedulable if it has a feasible placement that meets all 
the constraints. When the objective is restricted to 0, the above problem is how 
to place all the packets into the rectangular area with dimensions L x P. This is 
the same as the 2D bin packing problem, in which a set of rectangular items is 
packed into a 2D rectangular bin. The NP-hardness of the 2D bin packing problem 
has been proven [21]. Since our problem can be reduced to the 2D bin packing 
problem, it is at least NP-hard. Hence, there is no polynomial time algorithm for 
finding an optimal solution. Although the specification (Eq. (6.1)-(6.6)) can be 
solved by OMT solvers, e.g. Z3, for complicated systems, the execution time of 
solvers cannot be acceptable. Therefore, in the following section, we will propose a 
heuristic algorithm to schedule packets. 


6.3 Scheduling Algorithm 


Firstly, we introduce a basic scheduling algorithm that does not consider the time 
constraint P and does not support any packet being covered. Secondly, based on 
the basic scheduling algorithm, we analyze the sufficient condition and necessary 
condition for schedulability. Finally, we extend the basic algorithm based on the 
two conditions to support time constraints and packet covering. 


6.3.1 Basic Scheduling Algorithm 


The basic scheduling algorithm is a pseudolevel-packing algorithm (as shown in 
Algorithm 6.1). Fig.6.5 shows an illustration. There are two types of packing 
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Searching order a[ylIx] 


frequency(y) 


Fig. 6.5 Illustration of Algorithm 6.1 


levels: super level and local level. A super level consists of multiple local levels. 
A local level is similar to the level that is widely used in multi-level bin packing 
algorithms, and its length is determined by the first packet placed at this local level. 
Algorithm 6.1 processes packets from the highest criticality level to the lowest 
criticality level (line 2). The packets with the same criticality level are placed at 
the same super level (lines 3-21). After all the packets in one criticality level are 
finished, a new super level is created to hold the packets in the next criticality 
level (lines 18-20). At each criticality level, the algorithm, first, sorts the packets 
according to the decreasing order of their transmission time durations (line 3), 
and then places packets in the same order (line 5). The single quotation mark on 
the symbols indicates that the symbols are sorted. For example, a sorted packet is 
denoted by 1, and its frequency bandwidth and transmission time duration are /; 
and c, respectively. h is the finish time of all placed packets, and array a[ ][ ] 
is used to indicate which resources are occupied. If a[y][x] = 1, the resource at 
time slot x and on frequency y is occupied; otherwise, the resource is available. 
At each local level, the algorithm searches resources first in the order of time slots 
and then in the order of frequency (lines 6-15). If an available resource is found, 
i.e., a[y][x] = 0, the upper left corner of T is placed at coordinates (x, y) (line 
16). The algorithm does not need to check all the resources that will be occupied 
by the rectangle of t/ because according to the above searching order, the other 
resources must be available as long as a[y][x] is available (Theorem 6.1). Then, 
based on the resources requested by p , a[ ][ ] is updated (line 17). If there is not 
sufficient resource in the current local level, a new local level is created (lines 7 and 
8). Repeat this process until all packets are placed. Finally, the locations of all the 
packets and the finish time are returned to the calling function. The time complexity 
of the algorithm is O(|l'|LA) because the algorithm traverses all the packets of T 
(lines 2 and 5) and all the resources of L x h (line 6). Since the basic scheduling 
does not support any packet being covered, the objective value is not considered in 
Algorithm 6.1. 
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Algorithm 6.1 Basic Scheduling Algorithm BasicSch(T) 
Input: I 
Output: V(x;, yj) and h 
1: h = 0; a[]L] = {0}; 
2: for each e = X to 1 do 
3: sort the packets of I'^ according to the decreasing order of their c7, where the first packet 
ti has the largest c% ; 
4 x=h;y=0;g=c); 
5:  foreachi = 1 to |T^| do 
6: while (a[y][x] == 1) or (y + /; > L) do 
T: if (y + I; > L) then 
8: h-x-hctgiy-cO0g-c5 
9: else 


10: x=x4+1; 

11: if (x > h + g) then 
12: x=h;y=y+l1; 
13: end if 

14: end if 

15: end while 

16: Xp—Xx»x-y 

17: Vj € [y, y +1), Vk € [x x + cf), aljMk] = 1; 
18: if (i == |P°|) then 

19: h=h+g8g; 

20: end if 

21: end for 

22: end for 


23: return V(x;, y;) and h; 


Theorem 6.1 In the process of placing v;, if a[y][x] = 0 and y + l; < L, then 
Vj € [0, 1), Vk € [0, c7), aly + jllx +k] = 0. 


Proof Assuming that 3$ € (y, y + D), 3$ € (x, x + gy. a[5][X] = 1. Recall that 
5G numerology defines only three subcarrier spacings, 15 KHz, 30 KHz, and 60 kHz. 
Thus, at a super level, there are three types of packets. We use B!, B? and B? to 
denote them, and their widths and lengths are B! = (l L c!), B? = (Ê, c?), and 
B? = (P, c?), respectively. Based on the definition of three subcarrier spacings, we 
know that 4 x 1! = 2 x ? 2 P and c! = 2 x e = 4 x œ. Since y + L < L, 
the frequency resource is sufficient. Hence, we do not need to discuss the frequency 
dimension. We use row y to denote the last row above row y and occupied by some 
packets. In the following, we discuss the three cases of row y. 


(1) There is no row y, and row y is the first row. However, in Algorithm 6.1, the 
first row of a local level is fully occupied because only when the first packet is 
placed is a new local level with the same length as the packet created. Therefore, 
the unoccupied a[y][x] is not at the first row, and row y must exist. 

(2) Row y is occupied by the same type of packets. The same type of packets 
cannot be two B! because each local level contains at most one longest packet. 
If a[y][x] = 0 and a[$][] = 1, then the placement is shown in Fig. 6.6a. 
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Fig. 6.6 Illustration of Theorem 6.1. (a) Illustration of Case (2). (b) Illustration of Case (3) 


However, according to the searching order, the upper-left corner of the right 
packet should be at the point marked with a cross. Then, a[][X] is not occupied, 
i.e., a[ĵ] [$] = 0. 

(3) Row y is occupied by different types of packets. The different types of 
packets must be B? and B?. Since in Algorithm 6.1, the packets are sorted 
according to the decreasing order of their transmission time durations, the 
current packet t; must be B 3. The illustration is shown in Fig. 6.6b. We know 
that c? = 2 x c?. Thus, two whole B? can be placed below B?. There is no other 
packet with different transmission time durations in this problem. Therefore, the 
packet that is not aligned with B? does not exist, and a[4][£] is not occupied, 
i.e., aLy][X] = 0. 


To sum up, a[y][x] is not consistent with its definition. The above assumption 
does not hold. Therefore, V? € (y, y + l), VX € (x, x + e. a[y][X] = 0. 
Oo 


6.3.2 Analysis 


We, first, analyze the networks with only one criticality level, and then extend to 
multiple criticality levels. To simplify the description, when only one criticality level 
is considered, we ignore the symbols about criticality levels. In the networks with 
only one criticality level, the lengths of local levels can be cl, c? or P, and L > P. 
Assuming that in the result of Algorithm 6.1, there are nl, n?, and n? local levels 
with lengths of cl, c, and c3, respectively, and n! ne, n? eNA {0}. 

When only one criticality level is considered, Algorithm 6.1 has the following 
two properties. 


Property 6.1 Ifn! +n? +n? = 1, and L approaches infinity, the resource utilization 
can be infinitesimal. 
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For example, there is only one packet in the network. Then, the resource 


utilization is pai , which decreases as L increases. 
t 


Property 6.2 V£ n! +n? +n? > 1, the resource utilization must be greater than 1. 


Proof 'To calculate the resource utilization, we need to analyze the amount of 
resources required by Algorithm 6.1 and the amount of resources actually occupied. 
The resources required by Algorithm 6.1 is E = L(nlc! 4-n?c?-- n? ?). The amount 
of resources actually occupied is analyzed in each of the following cases: 


(1) For the local levels with a length of c!, since the width of B! is the unit width, 
in the first n! — 1 local levels all the resources must be occupied. At the n!-th 
local level, if n? = n? = 0, atleast one Bl is placed, i.e., only 1! x c! resources 
are occupied; if n? Æ 0, at most (/* — 1) x c! resources are idle because /* x c! 
resources are sufficient for the subsequent B?; similarly, if n? = 0 and n? Æ 0, 
then (I? — 1) x c! resources are idle. Thus, in the worst case, the amount of 


resources actually occupied in the n! local levels is 


Lcl'(n! — 1) + i!cl, ifn? 2 5 20 (R11) 
Ri = 4 Lcln! — (D — Del, ifn? 20,n? 40 (R15) 
Lclnl — (I? — 1)cl, others. (R13) 
(2) For the local levels with a length of c?, in the first n? — 1 local levels, at most 
(L mod i?)c? (n? — 1) resources are idle. This is because after placing several 
B?, the last (L mod 1?) rows are not enough for the next B?. At the n2-th 
local level, if n? = 0, at least /2c? resources are occupied; otherwise, min{ (L — 
12), (P — 1)) c? resources are idle because if L is very short, after placing B?,the 
remaining resources may be less than (/? — 1)c?. Thus, the amount of resources 
actually occupied in the n? local levels is 


— 


Le?(n* —1)—(L mod I”) xc?(n?— D Pc, ifn? 20 (R21) 
Ro = 1 Lc?(n? — 1) — (L mod 2) x (n? — 1) + Lc? 
— min((L — 17), (P — D) x c?, others. (R22) 


(3) For the local levels with a length of e, R3 is similar to Ro, i.e., 
R3 = Le (n? 2 1) - (L mod D) x e? — 1) + Pe. 


Thus, the total amount of resources actually occupied is R = Rj + R2 + R3. 
Table 6.2 shows R and the lower bounds of utilization R under different cases. The 
lowest bound in Table 6.2 is 1. Therefore, the resource utilization must be greater 
than 1. o 


Then, based on Properties 6.1 and 6.2, we analyze the sufficient condition for a 
packet set to be schedulable by Algorithm 6.1. Theorems 6.2 and 6.3 are about one 
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Table 6.2 Lower bound of resource utilization 


nl, n2, i? R = Ri + Ra + R3 Lower bound of £ 

0,0,- R3 2 > i (when m=2,L= +oo) 

0,-,0 R54 2 > i (when n? = 2, L = +00) 

0,-,- R3 + R3 2 > 4 (when L=7,R =w = 1) 

-,0,0 Ria 2 > i (when n! = 2, L = +00) 

-,0,- Rio + R3 i > 7 (when L = 7, n! =n? al) 

-,-,0 Ria R21 E i (when n! =n? = 1) 

-—- R13 + R22 + R5 g > 7 (when L — 7, n! =n? =1,n3 Foo) 


criticality level and multiple criticality levels, respectively. In addition, the necessary 
condition for a packet set to be schedulable is shown in Theorem 6.4. 


Theorem 6.2 For a packet set with only one criticality level, Algorithm 6.1 can find 


: i à . 2 wer lixci 
a feasible placement under the time constraint P, if max{maxy;, er {Ci}, ———] 


x P. 


Proof First, we discuss the placements with at least two local levels, i.e., n! + 
n? +n? > 1. In the proof of Property 6.2, R is the lower bound of resources actually 
occupied. Thus, the actual occupied resources a er li X cj is greater than or equal 
to R, i.e., 2 vrer li, xcq>R> iE ES iL(c + n?c? + n?c?). Rewriting, we 
obtain 


25. li x ci 
Yrer“ L 
pos > nicl + nic? + n3c3. 


If < P, then the actual length nic! + n?c? + n3c3 must be less than 
P, i.e., the placement is feasible under the time constraint P. 


2 Dvr er li Xci 
L 


2Y ser li xci 
Then, from Property 6.1, we know that when n! +n? +n? = 1, DUEB 


be infinitesimal. However, the length of a local level is equal to the longest length 
of all the packets, i.e., maxy; er {ci}. Therefore, combining the above two cases, the 


. n . 2» ver li xci 
sufficient condition is max(maxvz er(ci], ——;——} x P. oO 


Theorem 6.3 (Sufficient Condition) For a packet set with multiple criticality 
levels, Algorithm 6.1 can find a feasible placement under the time constraint P, 


if 
2 vuere li X 6j 


> (max{ max {cf}, ———————}) <P. (6.7) 
Viele L 
Vee[1, X] 
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Proof Since Algorithm 6.1 places super levels one by one, the total length is the 
sum of the lengths of all super levels. If the total length satisfies the time constraint 
P, the placement is feasible. o 


Theorem 6.4 (Necessary Condition) For a packet set with multiple criticality 
levels, if Algorithm 6.1 find a feasible placement for a packet set under the time 
constraint P, then the packet set satisfies the following condition: 


X du <LxP. (6.8) 
Vier 


Proof Note that in Algorithm 6.1 high-criticality packets are not allowed to cover 
low-criticality packets. Thus, if a packet set is schedulable, its resource utilization 


r D ovr;er li xe" 
cannot be greater than 100%, i.e., ——7—5—— < 100%. oO 


6.3.3 Improved Algorithm 


In this subsection, we design our scheduling algorithm based on Theorems 6.3 
and 6.4. The objective of our problem is to cover as few packets as possible, and 
low-criticality packets are covered first. Hence, we sort the packets of F according 
to the following rules: Vr;, tj € I, 


e if x; > Xj, then 7; is before v; in the ordered set rE 
* if x; = xj and c” > R then 7; is before v; in the ordered set I’; 


J 


e if xi = xj, e = e andi < j, then rj is before v; in the ordered set I". 


To optimize the objective, an effective method should cover the packets of I" from 
back to front. First, no packet can be covered, and the current packet set is checked 
whether it is schedulable or not under the time constraint P. If the packet set is 
not schedulable, the last packet tp, is covered. Then, if the new packet set is 
still unschedulable, the packets vjr/j-; and tr; are covered. Repeat this process 
until a schedulable packet set is found. However, this method may traverse all the 
packets. To improve the efficiency of our algorithm, we reduce the solution space 
based on Theorems 6.3 and 6.4, and adopt binary search instead of linear search. In 
Theorem 6.5, we proved that the solutions corresponding to the sorted packets are 
also ordered. Thus, the binary search can be used in our scheduling problem. 

Our scheduling algorithm is shown in Algorithm 6.2. The variable k is used 
to denote that the packets after the k-th packet can be covered, and the function 
Cover (k) (Algorithm 6.3) is to determine which packets are selected to cover these 
packets. In Cover (k), for each packet p (i > k), the metric 5; indicates the area 
difference between Tj and t; in the time-frequency coordinate (line 8). If tj cannot 
fully cover Uu , it is not an available selection (line 4). This is because if a small 
packet covers a big one, then the overflowing part of the big packet will change 
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Algorithm 6.2 Scheduling Algorithm with packet Covering (SAC) 
Input: I", X, L, P 
Output: V(x;, yi) 
1: k= |r]; 
2: do 
3: T; = Cover(k);k =k- 1; 
4: while T; does not satisfy Eq. (6.8); 
5: right =k+ l; 
6: do 
7: Ty = Cover(k); k =k- 1; 
8: while k > 0 and T; does not satisfy Eq. (6.7); 
9: left =k + 1; middle = | e£crieh |; 
10: while left Z middle do 
11: BasicSch(T, idate) returns V(Xmiddle,i, Ymiddle,i) and h; 
12: ifh < P then 


13: left = middle; 
14: else 

15: right = middle; 
16: end if 


17: middle = | Ieftirish |: 
18: end while 

19: if left = 0 then 

20: return FALSE; 

21: end if 

22: return V(Xieft,i, Yleft,i): 


the shape of the small packet, and the schedulability will become worse. Then, to 
reduce the waste of resources, the packet with the minimal 4; is selected to cover 
t; (lines 14-15), and the covered packet t/ is marked in T” (line 16). If none of 
the packets have a valid ĉj, t/ cannot be covered even though it is allowed to be 
covered. Finally, Cover (k) returns I’ to Algorithm 6.2, called as Ty. 

Algorithm 6.2 traverses k from |I"| to 1 (lines 1, 3 and 7). The first k that 
makes T; satisfy the necessary condition is the rightmost element of the binary 
search (lines 2-5), and then similarly, the first k that makes I’, satisfy the sufficient 
condition is the leftmost element (lines 6-9). Then, in the binary search, for each 
middle element, BasicSch(T,,;,,) is invoked to place the packet set and returns 
the finish time. If the finish time is not greater than time P, a better solution may 
be between middle and right; otherwise, a feasible solution is between left and 
middle (lines 12-17). The leftmost element is always a feasible solution unless no 
packet set satisfies the sufficient condition. Thus, only when the leftmost element 
is 0, and the middle element does not search for any feasible solution, cannot 
be scheduled (lines 19 and 20). Otherwise, the placement under the latest leftmost 
element is the final solution (line 22). In Algorithm 6.3, the number of iterations 
in lines 1, 2, 6 and 15 is O(|I'’|), O(|T'|), O(X) and O(X), respectively. Thus, 
the time complexity of Algorithm 6.3 is O (|V'|? X). In Algorithm 6.2, the number 
of invoking Cover( ) is O(|I'|) in the worst case. Hence, the time complexity of 
lines 1—9 is O (|P]? X). Then, in lines 10-18, the number of invoking BasicSch( ) 
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Algorithm 6.3 Covering Algorithm Cover (k) 
Input: I", k 
Output: I", and obj 


1: for each i = |I| tok do 

2 for each j = i — 1 to 1 do 

3 if Xj < xj or li < l; then 

4: ôj = +00; 

5: else 

6: for each g = x; + 1 to Xj do 

7 ifc? > c% cj then 

8: sj = Gi — e xl — c" x l5 
9: break; 
10: end if 
11: end for 
12: end if 
13: end for 
14: if 3c, 0, # +00 and At), ôm < 6, then 
15: Vg e [1, x], of =o x cf 
16: 1; is marked as covered in I"; 
17: end if 
18: end for 


19: calculate obj based on Eq. (6.1); 
20: return T” and obj; 


is O(log |I"), and its time complexity is O(log |I"| - |V'| LA). Therefore, the time 
complexity of Algorithm 6.2 is O (n^). 


Theorem 6.5 [fT can be placed before time P, so can V, ,, and Obj(V, )) = 
Obj (Vj); if V, cannot be placed before time P, neither can V, , 4, and Obj (T) = 
Obj (V, , 4), where Obj (T) is the objective value of packet set V; calculated based 


on Eq. (6.1). 


Proof First, we prove that for our problem if T; has a feasible solution, then I; , 
also has a feasible solution, and the objective value of I7, , is not greater than that 
of I';. The difference between I”, and D; , is that t; may be covered in T ,, while 
it cannot be covered in I’. There are two cases: 


1. When rj is covered in I’, we discuss all types of tj as follows: if v; is Bl, in 
the same local level, the places of the subsequent packets are moved from (x, y) 
to (x, y — l1); if t is B?, in the same local level, the places of the subsequent 
packets are moved from (x, y) to (x, y — P), œ- e, y) or (x + c?, y— 1); if 
T; is B?, in the same local level, the places of the subsequent packets are moved 
from (x, y) to (x — c?, y) or (x + (the length of the local level) — 1, y — /?). In 
these cases, the subsequent packets are moved up and left in the same local level. 
Therefore, the finish time of i , is not later than that of r}. In addition, based 


on Eq. (6.1), if t is covered, Obj (T; ,) = Obj T) + e*t, 
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2. When 1; is not covered in D; ,, r} = I; ,, ie., they have the same solution, 
and Obj(I'_,) = Obj (T). 


Therefore, if I, can be placed before time P, so can I; ,, and Obj(T, j) = 
Obj (T). 

Second, for I, and Des S lf can be covered in I';, but cannot be covered 
in T; 44: Packet Tj ,1 introduces more delay into the solution of T, 44: Hence, if 
the finish time of T; is greater than P, I; , cannot finish before time P. For the 
objective value, if t;  , is coveredinI,, Obj (T;) = Obj (Py) tox ; otherwise, 
Obj(T,) = Obj (T, +1). Therefore, if T; cannot be placed before time P, neither 
can I”, ,, and Obj (T;) = Obj(T, Oo 


k--r +1) 


6.4 Performance Evaluations 


In this section, we will evaluate our proposed algorithms based on a 5G testbed 
and extensive test cases. Five metrics are used in our evaluation: (1) packet loss 
ratio (PLR) is the ratio of the number of lost packets to the total number of sent 
packets; (2) schedulable ratio is the percentage of test cases for which an algorithm 
can find a feasible solution; (3) objective value is calculated based on Eq. (6.1); 
(4) execution time is the time required to find an optimized solution; and (5) the 
number of calls to BasicSch( ) reflects the effectiveness of Theorems 6.3 and 6.4. 
Our proposed algorithm SAC is compared with the following methods: 


1. OMT adopts the Microsoft solver Z3 [22] to solve our OMT specification 
(Eq. (6.1)—(6.6)). Although the solver can find the optimal solution, its execution 
time is unacceptable when the problem is complex. Thus, OMT only appears in 
simple test cases. 

2. T4 corresponds to the necessary condition of Theorem 6.4. We need an excellent 
baseline to illustrate the effectiveness of our algorithm. However, there is no 
method to find optimal solutions for complicated test cases. Therefore, when Z3 
cannot find optimal solutions in an acceptable time, we replace OMT with T4. 
In T4, if a test case satisfies Theorem 6.4, it is considered schedulable. Hence, 
for schedulable ratios, T4 is better than optimal solutions when packets do not 
cover each other. For the objective of the scheduling problem, if in an ordered 
set I" the first k packets satisfy Theorem 6.4, we assume that the packets after 
these k packets are covered. Thus, the objective value of T4 is always better than 
optimal solutions. Note that T4 is an analysis-based method and cannot generate 
schedules. Hence, we do not consider its execution time and PLR. 

3. FFDH (First-fit decreasing-height) [23] is a classical strip packing algorithm. 
In this chapter, FFDH sorts packets by order of non-increasing length, and then 
scans the local levels from left to right. Each packet is placed in the first level 
that has sufficient resources. FFDH does not support packet covering. Hence, 
we cannot calculate the original objective value of FFDH. The original objective 
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Table 6.3 Parameters Number of packets 


Number of criticality levels 
Number of resources in the frequency dimension 


W/O) = 


Number of time slots in the time dimension 


value is about the covered packets. In FFDH, this kind of packets includes those 
that cannot be placed before P. Therefore, when some packets cannot be placed 
before P, they are used to calculate the objective value based on Eq. (6.1). 

4. SACwoT3T4 is the same as SAC except that it does not adopt Theorems 6.3 
and 6.4 to reduce the solution space. 


All algorithms are written in C and run on a Windows workstation with a 3.7 GHz 
CPU and 64GB memory. The parameters used in the evaluation are summarized in 
Table 6.3. In a test case, N packets are transmitted in a 3D space of dimensions X x 
L x P. For each packet, its criticality level is randomly selected in the range [1, X], 
and the transmission time duration at the lowest criticality level and bandwidth are 
randomly selected in ((1, 4), (2, 2), (4, 1)]. 


6.4.1 Evaluations Based on A Real Testbed 


In this subsection, we evaluate packet loss ratios of different algorithms. The other 
metrics are not affected by the signal quality, and are more suitable for being tested 
through extensive test cases, which are shown in the next subsection. Our 5G testbed 
is shown in Fig. 6.7. It operates in the 2.4 GHz unlicensed ISM (industrial scientific 
and medical) band. The licensed spectrum is managed by mobile network operators, 
and developers and users cannot modify any strategy. Therefore, we adopt the 
unlicensed spectrum. If the strategies on the licensed spectrum are allowed to be 
customized, our proposed algorithms can be used without modification. Two 5G 
devices [24] are configured as a base station and a user, respectively. Since the 
signal quality is dynamic and unknown, to guarantee fairness for all algorithms, we 
trace the states of 7 subcarriers and 60,000 time slots, and then conduct trace-driven 
simulations. 

First, in small networks, we compare SAC and OMT (as shown in Fig. 6.8). The 
parameter setting is < N, X, L,P >=< 10,4, 7,20 >, and CLx denotes the 
PLR at criticality level x. Since high-criticality packets can be transmitted more 
times, CLA is the lowest. Both OMT and SAC successfully assign resources to the 
packets at criticality level 4. The average of CL4 in OMT and SAC are 0.07296 and 
0.067%, respectively. The slight difference between them is caused by the dynamics 
of signal quality. The average of CL1 in OMT and SAC are 16.7% and 17.0%, 
respectively. In SAC, since some low-criticality packets share resources with high- 
criticality packets, the CL1 of SAC is lower than that of OMT. 
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Fig. 6.7 5G testbed 
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Fig. 6.8 Packet loss ratios in simple networks. (a) OMT. (b) SAC 


Second, since OMT cannot solve complex problems in an acceptable time, 
we compare only SAC and FFDH in large networks (as shown in Fig. 6.9). The 
parameter setting is « 80, 4, 7, 80 >. Since the control period of the robotic arms in 
our testbed is 20 ms, and our 5G device supports 4 time slots in 1 ms, the parameter 
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Fig. 6.9 Packet loss ratios in complicated networks. (a) SAC. (b) FFDH 


P is set to 80. FFDH reserves dedicated resources for packets. When high-criticality 
packets do not need to be retransmitted, the assigned resources are wasted, and low- 
criticality packets cannot obtain sufficient resources. Hence, the average of CL1 is 
about 70.9%. In addition, FFDH does not consider criticality levels so that some 
high-criticality packets are discarded. Therefore, FFDH has higher PLRs even at 
the highest criticality level. The average of CL4 in FFDH is about 17.296. SAC 
makes the best of resources to guarantee the requirements of packets. Hence, the 
average of CL4 in SAC is 0.071%, and for CL1, when the PLR of FFDH is 70.9%, 
SAC improves PLR to 42.7%. In SAC, the PLR of the highest criticality level is 
still about 0.07%, no matter which networks SAC is used in. Thus, SAC makes 
communication reliability greater than 99.9% under real-time constraints. 
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Fig. 6.10 Comparison with OMT. (a) Schedulable ratio. (b) Objective value 


6.4.2 Evaluations Based on Random Test Cases 


In the following, we will randomly generate extensive test cases to comprehensively 
evaluate the performance of our proposed algorithms. To make Z3 solvable, the 
parameter settings are < [10, 18], 4, 7, 20 >, and 12 processes run in parallel on our 
workstation. For each parameter setting, 200 test cases are randomly generated. The 
results are shown in Fig. 6.10. The objective value is the average of 200 test cases. 
As N increases, the schedulable ratios decrease, and the objective value increases 
because it is hard to find optimized solutions when more and more packets are 
transmitted on limited resources. The worst algorithm is FFDH. Since in FFDH, no 
packet can be covered, the limited resources are not sufficient to transmit all packets. 
OMT has the highest schedulable ratio and the lowest objective value. However, the 
execution time of OMT fluctuates greatly. For some simple test cases, Z3 cannot 
find solutions within 12 hours. Thus, we set the time limit of Z3 as 1 hour. The 
execution times are shown in Fig. 6.11. When N < 16, almost all test cases can be 
solved within the time limit; when N > 16, about 80% of test cases can be solved. 
Among all these test cases, the shortest execution time of OMT is 127 ms. Such a 
long execution time makes the network unable to respond to burst packets. However, 
when SAC is used to solve the same test cases, the longest execution time is only 
0.19 ms. Even for the complicated test cases used in the following evaluations, 
the execution time of SAC is less than 3.7 ms, and the amount of memory space 
required is less than 1.2 MB. Therefore, SAC can quickly respond to burst packets 
and improve the flexibility of industrial networks. 

Then, we increase the complexity of test cases. The parameter settings in 
Fig. 6.12 are < [40, 100], 4, 10, 80 >. For each parameter setting, 1000 test cases 
are randomly generated. When N > 70, SAC has a higher schedulable ratio than 
T4 because SAC allows packets to share resources. Although this causes some 
low-criticality packets to fail to be sent, compared to FFDH, SAC discards only 
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Fig. 6.11 Execution times of Z3. (a) N = 14. (b) N = 16. (c) N = 18 


1600 
E 1400 |—e—SAC 


| -3I-- FFDH 1200 ||-43--FFDH 


HG 
e 
e 
o 


800 
600 
400 
200 


schedulable ratio 
objective 


Fig. 6.12 Comparison under varying N. (a) Schedulable ratio. (b) Objective value 


34% of packets, i.e., SAC makes a trade-off between schedulability and reliability. 
Therefore, in a network with limited resources, to schedule more high-criticality 
packets, SAC is the best choice. 

In Fig.6.13, we change X to illustrate its effect on schedulable ratios and 
objective values. The parameter settings are < (60, 70}, (3, 4, 5}, 10, 80 >. To show 
the results more clearly, the objective values of SAC under N = 70 and X = 4 
are displayed in Fig. 6.14. Due to the limited resources, some packets have to be 
covered. Hence, the objective values of some test cases are greater than zero. In 
Fig. 6.13, as X increase, the schedulable ratio decreases, and the objective value 
increases. This is because more retransmissions lead to a more serious lack of 
resources. Furthermore, if a test case contains more packets than others, it will be 
more severely affected by X. For example, Fig. 6.13b and d (N = 70) has greater 
fluctuations than Fig. 6.13a and c (N — 60). Thus, if there are many packets in a 
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network, X should be carefully determined based on the signal quality and should 
be as small as possible. 

The parameter settings in Figs.6.15 and 6.16 are « 80,4,[8, 16], 80 — and 
« 80,4, 10, [70, 95] >, respectively. As L and P increase, the schedulable ratio 
increases, and the objective value decreases. This is because the more resources, the 
easier it is to schedule packets. In Figs. 6.15 and 6.16, when their total resources 
are the same, their schedulable ratios and objective values are similar. For example, 
in Fig. 6.15, when L = 11, the amount of resources is 880, and the schedulable 
ratio and objective value are 95% and 56, respectively. In Fig. 6.16, if the amount of 
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Fig. 6.15 Comparison under varying L. (a) Schedulable ratio. (b) Objective 


resources is 880, then P is 88. When P — 88, the schedulable ratio and objective 
value are 9496 and 58, respectively. Therefore, L and P have similar effects on 
schedulability and reliability, i.e., if the time constraint P of an industrial system 
cannot be relaxed, we can improve the system performance by increasing L. 

In Fig. 6.17, we compare SAC and SACwoT3T4. SACwoT3T4 adopts binary 
search to find solutions in the whole solution space. Compared to SACwoT3T4, 
SAC reduces the solution space based on Theorems 6.3 and 6.4. Thus, SAC can 
significantly decrease the number of calls to function BasicSch(). When N « 70, 
SAC can directly find the optimal solution for almost all test cases. Therefore, 
Theorems 6.3 and 6.4 are effective. 
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Fig. 6.16 Comparison under varying P. (a) Schedulable ratio. (b) Objective 
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6.5 Summary 


This chapter focuses on the mixed-criticality scheduling problem of 5G NR. We 
present the mixed-criticality 5G NR model and formulate the problem as an 
OMT specification. Then, for the schedulability of the mixed-criticality scheduling 
problem, we analyze its sufficient condition and necessary condition. Based on the 
two conditions, we propose a scheduling algorithm. Finally, an industrial 5G testbed 
and extensive test cases are used to evaluate our proposed algorithm. The evaluation 


results indicate that our proposed algorithm can improve the real-time performance 
and reliability of 5G NR. 
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Chapter 7 ff) 
Conclusions and Future Directions Check for 


Abstract In this chapter, we summarize the book and provide three potential future 
directions for mixed-criticality industrial wireless networks. 


7.1 Conclusions 


In this book, we have introduced mixed criticality into industrial wireless networks, 
and presented analysis methods and scheduling algorithms to improve the QoS of 
industrial wireless networks. 

In Chap. 1, we presented the concept of mixed-criticality industrial wireless net- 
works. Industrial wireless networks have to meet the stringent QoS requirements of 
industrial applications. However, due to the openness of the wireless environment, 
the available network resources are limited. Mixed criticality is an advanced theory 
that makes limited resources fully utilized and can help industrial wireless networks 
improve their QoS. 

In Chap. 2, we presented an end-to-end delay analysis method for fixed priority 
scheduling in mixed-criticality WirelessHART networks, which can be used to 
determine whether all flows can be delivered to destinations within their deadlines. 
In evaluations, we compared our analysis results with simulations and a testbed. The 
results show that the pessimism of our analysis is acceptable and reliable. 

In Chap.3, we focused on the analysis method under the EDF policy. Firstly, 
we proposed a novel network model that can switch routing strategies based on 
the criticality of networks. When errors or accidents occur, the network switches 
to high-criticality mode and low-level critical tasks are abandoned. Secondly, 
we analyzed the demand bound of mixed-criticality industrial wireless networks 
under the EDF policy and formulated network demand bounds in each criticality 
mode. Thirdly, we tightened the demand bound by analyzing carry-over jobs and 
classifying the number of conflicts to improve analysis accuracy. Simulation results 
demonstrate that the presented methods can estimate the schedulability efficiently. 

Mixed-criticality data flows coexist in advanced industrial applications. They 
share the network resource, but their requirements for the real-time performance and 
reliability are different. In Chap. 4, we proposed a scheduling algorithm to guarantee 
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their different requirements, and then analyzed the schedulability for this scheduling 
algorithm. Simulation results show that our scheduling algorithm and analysis have 
more performance than existing ones. 

In Chap. 5, we first introduced MRI nodes into mixed-criticality networks. Then, 
we analyzed the transmission paths and obtained the candidate node set. Next, 
based on the characteristics of MRI nodes, we proposed the algorithm SAA and 
the algorithm PIA to guarantee the network schedulability in low-criticality mode. 
By considering system cost, these two algorithms help to reduce the number of 
MRI nodes used. Finally, we analyzed the schedulability of these two algorithms 
when the system switches to high criticality mode. The simulation results show that 
our scheduling algorithms and analysis perform better than the existing scheduling 
policy. 

In Chap. 6, we focused on the mixed-criticality scheduling problem of 5G NR. 
We presented the mixed-criticality 5G NR model and formulated the problem as an 
OMT specification. Then, for the schedulability of the mixed-criticality scheduling 
problem, we analyzed its sufficient condition and necessary condition. Based on 
the two conditions, we proposed a scheduling algorithm. Finally, an industrial 5G 
testbed and extensive test cases were used to evaluate our proposed algorithm. The 
evaluation results indicate that our proposed algorithm can improve the real-time 
performance and reliability of 5G NR. 


7.2 Future Directions 


There are many potential future directions for mixed-criticality industrial wireless 
networks. Here, we list three promising directions, as follows. 


* Criticality identifier for industrial communications. In mixed-criticality indus- 
trial wireless networks, high-criticality communications are assigned more net- 
work resources. However, which communications should be high-criticality? 
Criticality is not an inherent property of industrial communications and must 
be identified according to some rules. The identification rules determine whether 
there is a good match between industrial requirements and the goals of algo- 
rithms. If criticality levels are not correctly identified, even the optimal schedul- 
ing algorithm cannot meet industrial requirements. 

* Efficient scheduling algorithm when the criticality level is switched. On the 
one hand, the system state space is extremely complex, and there is no way to 
enumerate all the switching opportunities. On the other hand, when the criticality 
level is switched, there is no time to invoke a scheduling algorithm again. 
Under these restrictions, when the criticality level is switched, only the simple 
scheduling rule can be applied, such as preempting resources from the nearest 
low-criticality transmissions (as shown in Chaps.4, 5 and 6). There is still a 
gap between these simple rules and optimal solutions. Therefore, efficient and 
effective scheduling algorithms should be studied. 
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* Testbed supporting mixed criticality. Currently, mixed-criticality studies are 
evaluated based on simulations or real testbeds. Although the simulation is more 
flexible, it is too ideal to fully demonstrate the situation in real-world scenarios. 
A testbed can provide more comprehensive evaluations. However, there are no 
mixed-criticality hardware platforms and software protocol stacks. An easy-to- 
use testbed will facilitate research and development. 
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